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ABSTRACT 

This paper describes the definition of a typical next-generation space-based weak grav- 
itational lensing experiment. We first adopt a set of top-level science requirements from 
the literature, based on the scale and depth of the galaxy sample, and the avoidance of 
systematic effects in the measurements which would bias the derived shear values. We 
then identify and categorise the contributing factors to the systematic effects, com- 
bining them with the correct weighting, in such a way as to fit within the top-level 
requirements. We present techniques which permit the performance to be evaluated 
and explore the limits at which the contributing factors can be managed. Besides the 
modelling biases resulting from the use of weighted moments, the main contributing 
factors arc the reconstruction of the instrument point spread function (PSF), which is 
derived from the stellar images on the image, and the correction of the charge transfer 
inefficiency (CTI) in the CCD detectors caused by radiation damage. 

Key words: Cosmology: observations, cosmological parameters; physical data and 
processes: gravitational lensing; space vehicles: instruments; methods: statistical 
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1 INTRODUCTION 

In the current "Concordance Model" of cosmology, approx- 
imately three quarters of the energy density of the Universe 
consists of Dark Energy, and one fifth of Dark Matter. If this 
model is correct, the implications are significant, because the 
nature of both these dark components is unknown. If some 
other explanation for the appearance of the Universe is to 
be sought, then the implications are also momentous for our 
current understanding of physics and cosmology. 

Dark Energy is a relatively new entity in our under- 
standing of Cosmology. It has been known since the 1920s 
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that the typical separation between galaxies is growing with 
time - the Universe is expanding. But it might be supposed 
that in a Universe made up of only matter, this expansion 
is decelerating: the galaxies will move apart at a decreasing 
rate owing to their mutual gravitational interaction. How- 
ever, a little more than a decade ago, observations compar- 
ing different distance measures for supernovae (Riess et al, 
1998; Perlmutter et al., 1999) revealed that this is not the 
case, and the expansion is in fact speeding up. The cause 
of this acceleration is unseen, but has the characteristics of 
an extra energy density in the Universe; hence we label the 
entity as "Dark Energy". The importance of Dark Energy 
can scarcely be exaggerated. Most immediately, it represents 
the largest source of energy density in the Universe, ~ 75%. 
It is expected to dominate the future dynamics of the Uni- 
verse, so the origins and nature of the Universe cannot be 
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understood without some assessment of what Dark Energy 
is and what its physical characteristics are. 

The next most significant constituent of the Universe, 
Dark Matter, exceeds the normal baryonic matter in a ratio 
of four or five to one. As Dark Matter structures form under 
gravitational collapse, baryonic matter follows. Hence the 
Dark Matter drives the formation and evolution of the struc- 
tures we observe directly, because the behaviour of stars, 
galaxies and gas depends on the underlying gravitational 
potential created by it. While Dark Matter apparently in- 
teracts gravitationally in the same way that normal baryonic 
matter does, it seems not to interact through the electromag- 
netic force. Observations must therefore rely on inferring its 
presence through the gravitational effect it has on light or 
baryonic matter, by which means it has been inferred on a 
range of scales, galactic and larger. 

That it is critical to achieve an understanding of the 
nature of Dark Energy and of Dark Matter, and of the ex- 
amining and testing the alternative conceptual structures 
for the observed characteristics of the Universe, has been 
recognised for some time. Summaries are available in the 
DETF and ESA/ESO reports (Albrecht et al, 2006, Pea- 
cock et al, 2006), the ASTRONET Infrastructure Roadmap 
(Bode, Cruz & Molster 2009) and most recently by the US 
Decadal Survey report (Blandford et al, 2010). 

Initiatives are under way to further the accuracy and 
precision of the observations in order to address these 
questions, using facilities on ground (for example BOSS - 
Schlegel et al, 2009; BigBOSS - Schlegel et al, 2011; KiDS 

- de Jong et al, 2012; DES - DES Collaboration 2005, 
HSCQ, LSST - Tyson et al, 2003 and SKA - Blake et al, 
2007) and in space (Suc/irf - Laureijs et al, 2011; WFIRST 

- Green et al, 2012). These employ a combination of tech- 
niques, including weak gravitational lensing, galaxy cluster- 
ing (which incorporates baryonic acoustic oscillations) and 
supernovae, among others, both to distinguish between pos- 
sible cosmologies, and also to ensure that systematic effects 
in the measurements are identified and quantified at the re- 
quired level of accuracy. Control of systematic effects is crit- 
ical. Because of the inherently stable conditions that can be 
achieved, space missions provide the best opportunities for 
controlling systematics, and payloads can be designed also 
to include the capability to make observations using several 
techniques. 

Weak gravitational lensing uses statistical measure- 
ments of the distortions of galaxy shapes to study the clus- 
tering of matter in the Universe. Early studies were made by 
Wittman et al, (2000), van Waerbeke et al, (2000), Mellier 
et al, (2000), Bacon, Refregier & Ellis (2000) and Kaiser, 
Wilson & Luppino (2000) . Reviews can be found in Hoekstra 
& Jain (2008) and Munshi et al, (2008), while more recent 
work includes that by Schrabback et al, (2010) and Hey- 
mans et al, (2012). The rate at which the large-scale struc- 
ture has grown depends on the expansion rate of the uni- 
verse, so the nature of the acceleration can be characterised 
by making these shear measurements at different redshifts, 
looking back in time (Hu, 1999). 

In this paper we develop a framework by which weak 
lensing measurements in particular can be realised in a 



space mission, and its likely performance anticipated (many 
of these considerations apply also to ground-based weak- 
lensing surveys). The formalism for the critical systematic 
effects has been developed in a series of papers (Vale et 
al, 2004, Mandelbaum et al, 2005, Huterer et al, 2006, 
Stabenau et al, 2007, Amara & Refregier, 2007, 2008, 
Paulin-Henriksson et al, 2008, Kitching, Taylor & Heavens 
2008a, Amara, Refregier & Paulin-Henriksson 2010) with 
the most contemporary development given in Massey et al, 
(2012a) (hereafter MHK12), and we use these here as a ba- 
sis. 

The work was carried out in the framework of the Eu- 
clid missior0, under the auspices of the European Space 
Agency Cosmic Vision programme. An overview of its ca- 
pabilities can be found in the Euclid Red Book (Laureijs et 
al, 2011; this a consolidated summary of the mission at the 
end of the Definition Phase) and Amendola et al, (2012). 
With respect to its weak lensing capabilities, Euclid can be 
considered an example of a next-generation cosmic shear 
survey mission. Euclid is designed to carry out both weak 
lensing and galaxy clustering cosmological measurements, 
using a payload comprising a visible imager, with which the 
weak lensing measurements are made, and a near-infrared 
spectrograph-imager. For visible measurements, CCD de- 
tectors are the currently leading technology for large focal 
planes, and they are assumed for this paper. However, the 
methodology by which we address the realisation of a suc- 
cessful experiment is general, and although we will some- 
times use EucUd as an example, the purpose of this paper is 
to set out the applicable principles. 

Section 2 of this paper sets the requirements and de- 
scribes how an allocation can be made to the main fac- 
tors contributing to the performance degradations. Section 3 
briefly describes simulations and data processing. How the 
weak lensing performances may be evaluated is set out in 
section 4. 



2 SETTING THE REQUIREMENTS 
2.1 Mission Driving Parameters 

The power of a weak lensing survey depends on five main 
factors: 

(i) the size of the survey; 

(ii) the limiting magnitude of the survey; 

(iii) the size and shape of the instrument point spread 
function (PSF); 

(iv) how well this PSF is known and 

(v) how well we can correct for the sources of systematics. 

Parameters (i) and (ii) set the total number of galaxies 
that may be available for the weak lensing shear measure- 
ments, and their range in redshift. Hence they set the max- 
imum achievable statistical precision. This drives the area 
of the sky that should be observed, and consequently the 
field of view of the instrument, and the mission duration. 
A wide survey is required to ensure the measurements are 
representative of the observable Universe. A deeper limit- 
ing magnitude, by providing increased signal-to-noise ratios 
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on each individual galaxy measurement, also determines the 
size of the sample, and enables higher redshifts to be ac- 
cessed. Given that most of the cosmic acceleration has taken 
place in more recent epochs, the emphasis of most lensing 
surveys is on galaxies with redshifts z ~ 2 which makes 
them sensitive to structure at z ~ 0.5 to 1, halfway be- 
tween source and observer. Mitigating the confusing effects 
of intrinsic alignments between galaxies (resulting from the 
flows of material during the formation of structure in the 
Universe) also requires a sufficient survey depth (Joachimi & 
Bridle, 2010). Because the lensing signal is cumulative along 
a line of sight, the more distant sources contain information 
about Dark Energy at low redshift as well as the informa- 
tion about the growth of structure at high redshift. More 
distant galaxies generally are, however, fainter and smaller, 
which makes the measurement of their shear from the weak 
gravitational lensing more difficult for a given instrumental 
PSF. Measuring their redshift is, in addition, more difficult. 

The depth of the survey drives the collecting aperture of 
the telescope, its throughput, the width of the observational 
bandpass, and the sensitivity of the detectors. Achieving a 
desirable size and shape of the PSF drives all of these con- 
tributors: a larger telescope reduces the size of the PSF, the 
optical design drives its shape, the stability of the satellite 
pointing modifies the PSF, and the need for adequate sam- 
pling of the PSF drives the detector pixellisation to be small. 
This, in combination with the need for a large field of view, 
requires a large detector matrix. 

The overall "system" PSF is a combination of PSFs 
produced by the optical system, satellite pointing stability, 
detector pixellisation and detector effects. The detector ef- 
fects arise as a result of the physical realisation of the de- 
tectors (for example charge spreading in the pixel grid) and 
because of damage effects in space, particularly radiation 
damage (Holland et al, 1990). The first three contributions 
to the total PSF can be modelled by convolutions, while the 
fourth, generated by the detector, generally has a combina- 
tion of characteristics, only some of which can be modelled 
by a convolution. 

Limitations on the number of detector pixels may drive 
the observing strategy, for example requiring multiple expo- 
sures to recover spatial resolution from undersampled im- 
ages (Section I4.2.1|l . This also has the benefits of allowing 
cosmic rays to be detected and removed in the data pro- 
cessing. If there are small displacements between the dif- 
ferent exposures, then additionally, the impact of cosmetic 
defects in the detectors can be minimised, and the gaps be- 
tween the individual detectors can be filled in. Because of 
the detector gaps, some galaxies will have more exposures 
than others, and the effect of this on the best-fitting cos- 
mological parameters requires evaluation. With somewhat 
larger displacements, perhaps up to half of a detector, a 
further benefit is obtained in that the radiation damage ef- 
fects (Section 13. 5|) which increase with distance from the 
CCD readout node, are formally separable from the cos- 
mic shear. Typically, therefore, more than one exposure is 
taken of each field, and these will be combined to reach the 
depth required for the survey. The multiple exposures how- 
ever impact operational considerations negatively, require 
more fuel for the spacecraft repointing and require more 
telemetry bandwidth. 



For a typical advanced weak lensing survey, (such as 
that discussed in Laureijs et al., 2011), ~ 2n sr will be cov- 
ered to a depth AB ~ 25 at lOcr, yielding ~ 30 galaxies 
arcmin"^ with suitable characteristics for the survey - a 
total exceeding 10^ galaxies. The survey will generally be 
limited to Galactic latitudes \l\ ~ 30°, and will concentrate 
at least initially on regions furthest from the ecliptic plane, 
in order to minimise the Zodiacal background light. The 
pattern with which the fields are exposed will generally be 
constrained in order to maintain stable conditions within 
the payload. 

The survey and instrumentation must also be planned 
to minimise the systematic biases in the weak lensing mea- 
surements. Factors (iii) and (iv) strongly impact these sys- 
tematic biases, and therefore on the accuracy (as opposed 
to the precision) of the measurements. The size and shape 
of the PSF influence which fraction of the observed galaxies 
may be useful for shear measurements. The PSF blurs im- 
ages: the shape of galaxies with smaller sizes relative to the 
PSF will be measurable with reduced accuracy and hence 
smaller PSFs are desirable. For a given encircled energy 
width, a PSF with broad wings and narrow core will have a 
different effect on the shape measurement from one with nar- 
rower wings and a broader core. In addition, the detection 
limit of the survey will depend on the PSF, which influences 
the sample to some extent. 

Within these general constraints, the typical PSFs nor- 
mally achieved with standard astronomical instrumentation 
in space-borne observatories will be acceptable (and cer- 
tainly smaller and more stable than through a turbulent at- 
mosphere). The particular and stringent aspect of the weak 
lensing measurements is contained in the fourth and flfth 
main factors: the knowledge of this PSF, and how the biases 
can be corrected. The ultimate power of the weak lensing 
measurements will depend on the level with which the PSF 
is known, and future generation weak lensing surveys such 
as those considered in MIIK12 require this to be known to 
an unprecedented level of accuracy. Included in this knowl- 
edge is the way the PSF will change with time, with position 
on the focal plane and with source galaxy characteristics. 

MIIK12 provides the top-level context for this investiga- 
tion, while this paper provides a more detailed examination 
of the multiple contributing effects for each bias, and how 
they might be combined in a practical experiment. 



2.2 Quantifying the Biases 

In a typical future generation weak lensing survey from space 
(for example, Laureijs et al., 2011), with observations of > 
10^ galaxies, the errors on the linearly varying Dark Energy 
equation of state (Chevallier & Polarski, 2001; Linder 2003) 
w{z) = Wp + Wa{z — Zp)/[{1 + Zp){l + z)] are Wp ~ 0.05 and 
Wa ~ 0.2, to give a Figure of Merit (FoM) l/[AwpAwa] ~ 100 
from lensing only {zp is the redshift at which the error on 
w{z) minimises). As the precision increases as a result of 



It should bo recalled that the FoM is only one of the measures 
used for the effectiveness of Dark Energy investigations and the 
linear parameterisation in ui also is limiting. The form of the 
structure growth factor (Laureijs et al, 2011) and other measures 
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combining such large samples of galaxies, the control of the 
systematic effects becomes more and more important. 

Effective control of the systematic effects requires first 
an understanding of what effects may be present, and how 
they combine with each other to introduce biases. Then it is 
important to understand how significant each effect may be 
in the overall performance. Some effects can be minimised 
by better design of the instrument and survey, and by bet- 
ter calibrations; others by alternative approaches in the data 
processing and analysis. Each of these carries implications 
for the viability of the experiment and for the cost and dura- 
tion of the mission. For example, improved control of some 
biases may be achieved through newer technologies which 
carry more risk. Alternatively more conventional technolo- 
gies could be used and the gains sought in the data analysis 
algorithms. 

In this section we first summarise how the biases affect 
the derived cosmological parameters. Then we identify the 
factors which contribute to these biases and quantify their 
relative importance. Each factor generally has contributions 
from other sources. We organise these into a structure which 
allow the effect of each to be assessed; this attempts also 
to clarify the relationships of the contributing factors. In 
respect of each lowest-level factor, an initial analysis may 
suggest that a certain level of knowledge can be reached, 
but these may require revision in order to remain below the 
permitted total bias, which will lead to further more detailed 
analyses. The purpose of this section is not to identify the 
values of the factors for any particular experiment, but to 
rather illustrate a structure by which the performance of an 
experiment in terms of the control of systematic effects can 
be assessed, and the effects of changes in any aspect can be 
propagated to the top level. This allows the optimisation of 
the experiment to be achieved. 

The procedure for quantifying the biases is as follows. 
MHK12 and references therein consider that the true shear 
7 of a galaxy will differ from that actually measured, 7, by 
additive and multiplicative biases c and m (in the survey, 
instrument and measurement process) as 

7 = (1 -f m)7 4- c. (1) 

The two-point ellipticity correlation function is 

= {-if^fW), (2) 

where 6 is the angular scale and i, j refer to redshift bin pairs 
averaged over all pairs of galaxies A,B. This can be used 
(Hu, 1999) to constrain a set of cosmological parameters 
usually through the corresponding Fourier transform power 
spectrum Cij{£). As a consequence of the biases c and m in 
Equation [B Cij{£) will be modified (Kitching et al., 2012) 
by additive A and multiplicative A4 biases into an observed 



Cij is a function of spatial scale £ 
MIIK12 (equation 22) show that 

A = a'[\c\] 

M=2{m) + {mf +a^[m]. 



(3) 

2n/9 and redshift. 

(4) 



to be tested for the cosmology arc also relevant. However, the FoM 
is a standard generally used for the comparison of surveys. 



Non-zero A and A4 lead to a bias in the maximum 
likelihood values of measured cosmological parameters (see 
MIIK12) and an decrease in the FoM (through an increase 
in the covariance). As noted above, the contributors to (j[\c\] 
and m must be derived through a careful process of identify- 
ing all of the biases, including the imperfections in the galaxy 
modelling and other effects. The requirement adopted by 
MHK12 (shown in their figure 3) is that the systematic bi- 
ases from both A and A/f be ^ 0.31 of the random errors 
contributing to dj. To achieve this at the 95% confidence 
level requires 



A 10"' 
X 4.0 X 10" 



a[\c\] <: 10-' 
2{m) < 4.0 X 10"^ 



(5) 



We adopt the formulation in section 3.3 of MHK12, 
based on that in Paulin-Henriksson et al., (2008), Paulin- 
Henriksson, Refregier, & Amara, (2009): 



A 



p2 p2 p2 

-^7 ^R^ec 
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Rt 
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( jSiKc)) 
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-'l^R^ec 



R^ 



(6) 



where 



Rgs.1 
-R^ 



and then also 
2 / R^ 



PRRf 



'gal 



-Pfl^gal 



+ RI 



{sprY 



(7) 



^ -P-R 
1 



+ 



+ 



p2 
^R 



where 
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RU 

Rq 

-"gal 



(R'c) 



4-4- 



(Robs) 



(Rib.) 



(8) 



RohsiRo 



rtgal 



Rr., 



Rl 



SR., 

b7 



+ 



R„ 



SPe 



PrR. 



■gal 



Rl Pr 



(9) 



where R refers to the size of the PSF (convolution kernel) 
or galaxy image and the e to the polarisation, generally 
referred to as the 'ellipticity', defined in terms of the un- 
weighted second order moments in the image of the galaxy 
(Seitz & Schneider 1995, Bonnet & Mellier 1995). Exphcitly, 
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for a PSF ^{xi,Xj) and a weight function w{xi,Xj) these 
moments are 



Q^. 



J J ^{xi, Xj)'w{xi, Xj){xi — Xi){xj — Xj)dxidxj 



J J ${xi,Xj)w{xi,Xj)dxidxj 



Then, size 

= Qii + Q22 
and, eUipticity 

e = [ei, £2! = 



(10) 



(11) 



?ii — Q22 Q12 + Q21 



i?2 



i?2 



|e| = \/el+el 



(12) 



Returning to Equations [6] and [H the subscript c refers to 
those components of the PSF which can be combined by 
convolution, and nc to those that cannot. MHK12 use the 
subscript psf rather than c, but we use c to distinguish 
the convolutive part of the system PSF clearly, and as a 
reminder that we consider the term PSF here to refer to the 
end-to-end system PSF. 7?gai and i?obs refer to the original 
and observed size of the galaxy, relating as (MHK12) 



(13) 



(14) 



-Robs = y (-Rgai + RD + Rf^c ■ 

The 'shear polarizability' 

relates the eUipticity to the shear in the galaxy image caused 
by the weak lensing, and, although to some extent depen- 
dent on the galaxy sample and wavelength range used for the 
survey, when aggregated over galaxy samples out to z > 2 
is found to be approximately a constant factor 1.86 (Leau- 
thaud et al. 2007). 

The quantities Pr, P^^ and Pe^^c compensations for 
the necessity of using weighted quadrupole measurements 
rather than the unweighted moments that would be ideal 
in theory (see MHK12 for details). The weighting function 
w{xi,Xj) is introduced inside the integrals in Equal ion [10] to 
control the increasing noise fraction as the integration moves 
outwards from the centre of the galaxy or star image. Indeed, 
the use of weighted moments is the origin of the additional 
and fj, terms in Equations [S] and |S] by comparison with 
Equation 31 We define 

r»2 _ p rf2 

-'•'unweighted -^-R -^weighted 

S^unwcightcdc ^wcightcdQ 
^^unwcightcd^c -fejvC ^wcightcdNC ■ 

(15) 

In general P^c and Pei^c — 1 while Pr, is larger (in the 
Euclid case, Pr ~ 2). 

The system PSF properties change with wavelength, the 
spectral energy distribution (SED) /(A) and the transmis- 
sion as a function of wavelength r( A) . The size and eUipticity 
of the convolutive components of the PSF are given by: 



R^ 



and 



— / dAT(A)A/(A)i?^(A) 



rfAT(A)A/(A)e^(A), 



(16) 



(17) 



where /tot = J d\T{X)\f{\) is the total number of photons. 
Note the extra factor A converts from energy to photons. 



In Equations |S] and [S] (i^P) terms have been decom- 
posed into a systematic bias of a model value away from 
the truth (5)^, and uncertainties in this. To elaborate 
this important point, many biases in the measurements will 
be corrected by detailed modelling: an example may be the 
change in the size of the PSF resulting from out-of-band 
leakage, which may be calculated from the measured charac- 
teristics of the bandpass and the spectral energy distribution 
of the particular source. This model calculation will not, of 
course, produce exactly the true value. From uncertainties 
in the inputs to the modelling (in the example above this 
might include the uncertainties in the transmission at each 
wavelength), and inadequacies in the physical model (again 
in this example, this might arise from codes used to predict 
a stellar spectrum), the modelling will produce a slightly in- 
correct prediction (S) , and an associated uncertainty on this 
prediction a. 

We can write Equations [6] and [8] as follows: 



+ «2 {{Se^c)'^ [|£ncI]) 
-'[Rl] 



+ 



„ ( {S{R.c)f 

- as {{5{a)f + a^[a]) 



+ 



Ri 



(18) 



and 



M = m 



+ m2 



(ml)) 



+ ms 



+ rrbA 



Rt 



•'Mr 



+ 7715 m^i))+Cj[^]) 



(19) 



Recall that because A — A{£) and Ml — A4{£), in general 
these coefficients are also functions of angular separation or 
scale. 

To reiterate, the last terms of Equation[6]and|8]relate to 
the modelling error for the galaxies resulting from the fact 
that weighted, rather than unweighted quadrupole moments 
are used in practice. We have swept up all of the additive 
galaxy modelling errors in Equation [H] into a model error a, 
split into bias and knowledge errors {S{a)f and a^[a]. We 
have done the same for the multiplicative errors in Equa- 
tion|5] with the model error split into {5{fj,)) and alfi], so 
that 



and 



(20) 



In practice, the a terms in Equation [20] can be reduced to 
insignificant levels by ever-larger simulations, but these are 
ineffective in correcting for the S terms. Hence we will ignore 
the cr'^[a] and a[fj] terms in carrying forward any allocations 
for imperfections in the modelling. 
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Also, we should note at this point that the first term in 
Equation O 



5P^ 







for a sufficiently large survey. The third term captures the 
error which will arise from the use of the weighting func- 
tion in the quadrupole moment integral f Equation I10|l as a 
result of the absence of perfect knowledge of higher-order 
multipoles, while the second term is a knowledge error aris- 
ing from imperfect measurements, and hence is dependent 
on signal-to-noise ratios. 

Note also that Equation [TH] differs slightly from Equa- 
tion|6]to the extent that we introduce 5 terms in the first two 
lines. This accommodates the biases that will, in practice, 
occur in a weak lensing experiment which employs the use 
of bright stars to define the PSF used for faint galaxies, with 
the associated non-linearity and wavelength mismatching. 

So, Equations [TS] and [T^ contain terms for knowledge 
bias (5)^ and knowledge uncertainties ct^ in the following 
categories: convolutive, and non-convolutive errors in the 
PSF sizes; convolutive, and non-convolutive errors in the 
ellipticities; and bias errors a and on the transformation 
from ellipticity to shear resulting from the fact that we use 
weighted moments of the PSF. The coefficients Oi and mi are 
now seen to be weighting factors, whose values depend on 
the characteristics of the instrument and the galaxies being 
measured. Equations [TS] and [T^ provide the prescription by 
which these contributing effects can be combined. 

We are now in a position to quantify the impact on the 
cosmology from systematic effects in the weak lensing mea- 
surements given the knowledge biases (5), knowledge uncer- 
tainties o and weighting functions Ui and rrii. 



2.3 A Hierarchical Structure by which Systematic 
Effects may be Identified, Evaluated and 
Controlled 

We start by identifying lower-level contributing factors 
which might contribute to the knowledge biases and uncer- 
tainties. These may be grouped into categories, such as the 
imperfect knowledge of the source characteristics, calibra- 
tion errors, residual effects in correcting for detector effects, 
and the imperfect modelling of the PSF itself. Some cate- 
gories will pertain to the instrument design and others to 
the data processing. In order to minimise the interconnect- 
edness of the different factors, and hence to maximise the 
visibility and control of the bias effects introduced by each, 
some thought is required as to this categorisation of factors, 
and to the organisational hierarchy relating the categories. 

Within each of these categories there are several factors 
to be considered: for example, within the PSF modelling 
there are fundamental imperfections of the model, and then 
inaccuracies in the parameters derived for the model result- 
ing from photon statistics and pixelisation effects (which 
need to take account of multiple exposures, if these are 
used). In the category of calibrations, an example may be 
imperfect subtraction of the electronic reference level, an al- 
lowance for the effect of imperfect identification of cosmic 
rays, and so on. In the category of detector effects, an ex- 
ample may be the imperfect correction for radiation damage 



effects or detection chain non-linearity. This process must 
be continued to lower levels: in the last example the further 
contributing factors may include the output node linearity 
of the CCD, the linearity of the analog electronics associated 
with that node, the characteristics of the analog-to-digital 
conversion, and so on. In addition, it may be necessary to 
consider the stability of these different parts, and the accu- 
racy with which any factor can be established - this is also 
connected to the calibrations. 

As the level of accuracy required from the experiment 
increases, more and more factors must be considered, each 
of which will contribute to a degradation in performance. 
At some level, however, the factors become negligible, or 
can be made so by design or through operational strategies. 
For example, the flat flelding of the detector can be made 
unimportant by combining a large number of flat fleld cal- 
ibration exposures. In practice, these less significant effects 
may not be fully evaluated, at least in the early stages of a 
programme. 

We must now quantify the weighting functions a; and 
rrii . The first thing to notice is that we will want to calculate 
these in real space, because directly measured values will be 
used for -Rgai, Rq stc. To relate these to the limits derived 
in Fourier space, the simple scaling from A — >■ o"'^[|c|] and 

— >■ 2 (m) using Equation [5] is used. The a; weighting 
functions will remain unchanged, while the rrii can simply 
be halved. We will therefore make this adjustment, and from 
this point work in real space, using primes to designate the 
real space a'i = m and m'i = mi/2. 

If we assume a limiting magnitude mAB = 24.5, then we 
can adopt Rgai = 0.20 arcsec (MHK12). The value for R^ 
and -Rjyp will depend on the experiment: here we will use 
R^ — 0.22 and Rj^^, = 0.05 arcsec as in the Euclid mission. 
Then -Rgai/-Rc = 0-91' and RohJRc = 1-6- If we further 
set < 0.1, which is generally achievable in practice, then 
(elV^^ < 0.1. We wiU use (e^)^/^ = 0.1 and fix = 
1.86, Pr — 2.0 and = Pefjc = 1.0. We can now calculate 
the ai and m^: 



0.2 = 



1 




p2 p2 p2 


\ -Rgai / 


1 / 


1 2 -R^ 


p2 p2 \ 

1 ejvc \ 




(led') 


m 


p2 p2 p2 





+ 



1 R% 

PIRU 



0.5 = 0-3 

and 

m'l = 

m'2 = 
m-i = mi /4 

7714 — 7712 /4 



= 0.10 

= 0.74 

= 1.0 X 10" 

= 4.2 X 10' 

= 1.0 X 10" 



(21) 





1 Rl 








1 Rl 







Ro 



0.60 



= 0.17 



9.0 X 10"^ 
7.5 X 10"^ 
0.60 



(22) 
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It is evident that the dominant weighting factors to o"[jc|] 
are a'l and a'2, and these act through the PSF ellipticity. 
The lower values of a'g and a'4 allow the knowledge error of 
the PSF size to be more relaxed than the knowledge error 
of the ellipticity. For (m), m'l, acting through the PSF size, 
and m's are dominant, with a contribution from 7712. Hence 
for the additive bias, the ellipticity error will need the closest 
attention, while for the multiplicative bias, the size is more 
important. 

Having identified contributing factors and a way of or- 
ganising them, together with their combinatorial rules and 
weightings, values must now be assigned to these terms in 
Equation 1181 and 1191 in order to quantify their effects and to 
identify which are the more significant. Initially any values 
can be assigned in order to verify the combinatorial rules in 
the hierarchical structure. The next step is to include rea- 
sonable values for the factors. Many different considerations 
will bear on the values adopted for each factor, including 
the mission and instrument design, cost, calibration strate- 
gies, data analysis techniques, risk, organisational resources 
and many others. Once an approach is identified, the factor 
values will typically be established though calculations and 
simulations. In some cases, the tools may not be available 
to realise a value directly and a judgement must be made 
on the basis of experience as to what reasonable allocations 
should be made for each factor, until at some later stage the 
value can be established more quantitatively. These calcu- 
lations may indicate that some factors have disproportion- 
ate effects, while others are easily realised. This generally 
leads to a rebalancing with measures introduced to address 
the disproportionate effects (for example by a change in the 
technology used) and to simplify the approach in respect of 
those factors that are easily realised, until a viable function- 
ing point is achieved. 

In order to illustrate this process with an example, we 
have provided a hierarchical structure in Table [1] This iden- 
tifies how the terms in Equations 1 181 aud i 19l could be related 
to a model of instrumental or data-processing biases and 
uncertainties. Table[T]contains numerical factors for most of 
the categories (we provide example values from the Euclid 
case) . 

There are three broad categories in Table [1] refiecting 
those in Equations [18] and 1191 convolutive contributions; 
non-convolutive contributions; and model bias knowledge, 
with a final direct contribution of the residual spatial dis- 
tortion. Horizontal lines indicate that there is no allocation 
in this category in this example, though in general an alloca- 
tion should be made. Their indentation in the table indicates 
the organisation of the contributing terms. The penultimate 
of the broad categories contains the allocations within a and 
H of method errors, reflecting the final lines in Equations 1 181 
and ll9l However, we have not included bias error allocations 
a'^[a] and as these can be minimised by simulations. The 
simulations will not reflect reality at some level, so there are 
residual knowledge biases, the remaining terms {S{a))^ and 
{5{jj,)}, for which an allocation is assigned. 

It is easiest to follow the combination rules in Table [T] 
by starting from the right-most values. In the absence of any 
other information, the factors can be combined to calculate 
the next value in the hierarchy: quadratically if they are in- 
dependent, or linearly if they are not. They have therefore 
been prefixed with a Q or an L. Eventually, the combina- 



tions will lead to a row labelled as a 5 or a a, depending 
on whether categories of knowledge bias or knowledge un- 
certainty have been combined. Now these knowledge errors 
must be combined with the weighting factors a'j and in 
Equations [TS] and 1191 so the a and S values are multiplied 
by their corresponding a'^ and m'i to produce the values in 
the second column. These are then added, as prescribed in 
Equations [TH] and 1191 together with the residual distortion 
after correction factor, to provide the final values for (j[\c\] 
and (m) . Note that because the values are in real space, the 
total multiplicative bias is halved in converting from (m) to 
A4, as prescribed by Equation (5] and also that because the 
table propagates a rather than a^, due regard is required in 
calculating A. 

We will evaluate the values in Table [T] in the subsequent 
sections of this paper to examine whether they are reason- 
able to use as a basis for the knowledge bias and knowledge 
uncertainties that may be achievable in a practical exper- 
iment. The values assigned to the different factors will be 
different for different experiments, and we emphasise that 
the purpose of Table[T]is not directly to prescribe any values 
in particular, nor primarily to justify the values used, but 
rather to provide a conceptual structure for a weak lensing 
experiment in space. 

2.4 Absolute Characteristics of the PSF and 
Further Breakdown within Subsystems 

Recall that the PSF characteristics (factor (iii) in the 
discussion in Section 12. 1|) must be suitable. In particular 
this applies to the PSF size in terms, and its elliptic- 
ity e. In addition to R, F, the full width half maximum 
(FWHM) is also often used to provide an additional con- 
straint on the PSF, specifying the width of its core. This is 
primarily to guard against PSFs which might be problematic 
in some characteristics while nevertheless conforming with 
the requirement on R (for example annular PSFs, resulting 
from an out-of- focus condition). The PSFs can be evaluated 
using standard procedures to examine whether they meet 
the requirements. The only non-standard component in the 
breakdown is the contribution of the detector radiation dam- 
age effects to size and ellipticity, as these are not generally 
rendered in size and ellipticity terms. However, they can be 
calculated using Equations 1101 - [T^ 

When further breaking down contributions within the 
system, for example to constrain the individual contributors 
of different subsystems within the experiment to the overall 
system PSF, care is required in their combination. In partic- 
ular, the ellipticities must be weighted taking into account 
the values of the FWHM Fi for that particular contribution 
to the PSF: this is because a contributing factor may be in- 
trinsically strongly elliptical, but with small associated Fi it 
will be relatively unimportant. A reasonable approximation 
by which to combine the ellipticities is as 

etot = ^(i^'/J'tot)ei (23) 

i 

for the terms that can be represented by a convolution and 
etot = ^£, (24) 

for those that cannot. The knowledge residuals in Table [T] 
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<t[\c\] 1.3 X IC' Additive bias {='/A 3.2 X 10"'' - Equation^ 

(m.) 1.5 X 10~^ Multiplicative bias (= X/2 2.0 X 10"^ - Equation 

\eQ I a'j 7.6 X 10~^ Weighted convolutive contribution to ellipticity 

; cr : 2.3 X 10"'' PSF model ellipticity knowledge [Figure^ 

: : Q 2.1 x 10~* PSF model truncation error [Figure\6jj 

: : Q 9.7 X 10"^ PSF model coefficient error 

: : ; Q 7.0 X 10"^ Photon noise 

: : : Q 6.7 X 10~^ PSF model coefficient calibration error 

: : : ; Q 3.0 x 10~* Bias residual 

: : : : Q 3.0 X 10"^ Flat field residual 

: : : : Q 3.0 x 10~® Sky subtraction residual 

: : : : Q 3.0 x 10~® Linearity residual 

: : : : Q 3.0 x 10~® Cosmic Ray subtraction residual 

: : : Q Pixclisation elTects [Figure{Tj 

: : Q Spacecraft pointing 

: : ; ; Q Displacement between guider and instrument 

: : ; ; Q Guider measurement precision 

: 5 : 5.2 X 10"^ Model transfer to object 

: : Q 3.5 X 10~* Wavelength variation of PSF contribution 

: : Q 3.5 x 10~^ Linearity residual 

: : Q 1.0 X 10~® Out-of-band transmission fFiaure \ 1 1( 

ag 2.2 X 10"^ Weighted convolutive contribution to PSF size 

m'^ 3.0 X 10"* Weighted convolutive contribution to PSF size 

mg 2.1 X 10"* Weighted convolutive contribution to PSF size 

: cr : 4.8 X 10"" PSF model size knowledge [Figure^ 

: : Q 3.0 X 10"'* PSF model truncation error 

: : Q 3.7 X lO"' * PSF model coefficient error 

: : : Q 3.0 X 10"* Photon noise 

: : : Q 2.2 X 10~* PSF model coefficient calibration error 

: : : : Q 1.0 X 10"" Bias residual 

: : : : Q 1.0 x lO""* Flat field residual 

: : : : Q 1.0 x 10~* Sky subtraction residual 

: : : : Ql.OxlO"* Linearity residual 

: : : : Q 1.0 X 10"* Cosmic Ray subtraction residual 

: : : Q Pixclisation effects [Figure{Tj 

: : Q Spacecraft pointing 

: : : Q Displacement between guider and instrument 

: : : Q Guider measurement precision 

: 5 : 5.0 X 10~* Model transfer to object 

: : Q 3.5 X 10~* Wavelength variation of PSF contribution 

: : Q 3.5 x 10~* Linearity residual 

: : Q 2.0 X 10~^ Out-of-band transmission [Figure [TTI 

le^cl 02 9-5 X Weighted non-convolutive contribution to ellipticity (CTI) 

: : 1.1 X 10"'' Non-convolutive contribution to ellipticity (CTI) [Figures [Sl - [TOf 

: : cr : Model calibration error 

: : 5 : Fidelity of physical model 

R^Q a'^ Weighted non-convolutive contribution to size (CTI) 

rrij Weighted non-convolutive contribution to size (CTI) 

Weighted non-convolutive contribution to size (CTI) 

: : Non-convolutive contribution to size (CTI) 

: : (T : Model calibration error 

: : S : Fidelity of physical model 

a 5.2 X 10"'^ Weighted additive model bias knowledge [MHK12] 

: 5 : 5.0 X 10~* Additive model bias knowledge 

: : L Model error on simulated data with monochromatic galaxies 

: : L Residual CTI correction error 

: : L 4.2 X 10~^ Colour gradients [Semboloni et al, (2013)] 
^ mg 1.2 X 10~^ Weighted multiplicative model bias knowledge [MHK12] 

: S : 2.0 X 10~^ Multiplicative model bias knowledge 

: : L 1.0 X IQ-^ Model error on simulated data with monochromatic galaxies 

: : L 5.0 X 10~* Residual CTI correction error 

: : L 5.0 X 10"'' Colour gradients [Semboloni et al, (2013)] 

: 3.0 X 10~^ Residual distortion 

Table 1 . Potential contributors to the knowledge error terms in Equations 1181 and 1191 Where the allocations are preceded by Q they 
terms are added quadratically to generate the next level up and where they arc preceded by L they are added linearly. In the higher 
levels (if there are no Q or L designators) they are weighted as in Equations 1181 [191 1211 and 1221 to provide final values for y/A. and M/2. 
See text for details. Certain entries contain references to figures in later sections, where there is an evaluation of their feasibility. 
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should be similarly combined: 

<7^[£totl = ^(Fi^/Fl,,)''^'N (25) 

i 

for the terms that can be represented by a convolution and 

a^[£totl = Ea^[e,] (26) 
J 

for those that cannot. While all Si in Equations 1231 and 1241 
have equivalent Fi categories, this may not be the case for 
the (j[ei] ellipticity knowledge residuals in Equations 1251 and 
1261 In this case an appropriate mapping must be assigned, 
possibly through experience. 

2.5 Recapitulation 

To recap, the area covered by the survey, and its limiting 
magnitude, together with the overall characteristics of the 
PSF, will set the number of galaxies that can be used for 
the weak lensing measurements. This will provide a level of 
random error. In order to achieve a systematic error which 
is some moderate fraction of this random error set by the 
Poisson noise, a requirement then arises for the control of 
systematic effects. This can be apportioned between addi- 
tive and multiplicative effects, the impact of each of which 
depends on lower level uncertainties - the weighted indi- 
vidual knowledge bias (S) and knowledge uncertainties a of 
the convolutive, non-convolutive and model error size and 
ellipticity. 

For the practical experiment, an evaluation must be car- 
ried out for these individual component factors, to establish 
whether they are achievable using techniques at hand, or 
from reasonable projections of what techniques may become 
available in the timeframe of the mission. The most signif- 
icant factors must be quantified through an appropriately 
detailed assessment, for which Table [T] may be used as a 
starting point. The evaluation is likely to entail large scale 
simulations and evaluations. These will establish whether 
the value assigned to the different factors can be reached in 
any concrete design or procedure. If not, the values can be 
adjusted to relax the constraint on any one contributing fac- 
tor, but then others must be tightened accordingly, in order 
to remain within the required levels of A and A4. In this 
way, a balance may or may not be achieved, depending on 
the characteristics of the mission, with implications for its 
feasibility. 

So far we have provided the rationale by which the sys- 
tematic biases can be identified, organised and their com- 
bined effects evaluated, and provided an example hierarchi- 
cal structure in Table [T] Table [ijcontains example numerical 
values for the different factors, derived from the Euclid pro- 
gramme. We will examine in Section |4] and beyond some of 
the methods by which these values can be calculated, with 
the aim of illustrating the process, rather than providing a 
justification for any particular case. To do this, we first need 
to create simulated data, and process these in a representa- 
tive manner, so we discuss briefly how these may be done, 
concentrating on aspects of particular importance to a weak 
lensing experiment. 



3 SIMULATIONS AND DATA PROCESSING 
3.1 Procedure 

To quantify the impact of biases we must create simulated 
data with the appropriate level of fidelity, incorporating in- 
formation from laboratory and prototype tests. The simula- 
tions typically proceed as follows: 

(i) The telescope model is perturbed in terms of as- 
manufactured optical quality and optical alignment to pro- 
duce many different instances of the optical PSF at different 
locations on the field-of-view on a finely sampled grid. For 
part of the field of view, a finer sampling of field-of-view 
points for a wider range of instances may be used to explore 
variations on smaller spatial scales until no further power is 
found. Additional PSFs are produced to explore wavelength 
dependence. The aim is to sample the instrument states and 
the spatial and spectral variation of the PSF at many points 
on the field of view in a representative fashion. 

(ii) Satellite pointing displacement time series predicted 
from attitude control system simulations are used to build 
up PSFs from the displaced optical PSFs integrated over the 
exposure. 

(iii) Charge spreading within the CCD detectors from 
charge diffusion measurements on representative CCDs is 
then added: this has now generated the system PSF for any 
particular pointing displacement time series. 

(iv) These sub-images are then pixelised onto the detector 
grid with the appropriate intensity distribution according 
to real data (or an appropriate Galactic model) and the 
instrumental throughput. 

(v) Galaxy images are produced externally to the simula- 
tions using galaxy models, or from real data from deep field 
observations, and then scaled and rotated individually and 
with a number distribution consistent with observed num- 
ber counts. Weak lensing shears are also added at this stage 
if required. Each image is convolved with the system PSF 
derived from the steps above, again taking account of the in- 
strument throughput. These images are also pixellised onto 
the detector grid. 

(vi) Internal and external background, cosmic rays, in- 
strumental (mostly dark noise) and Poisson noise and in- 
strumental cosmetic effects (hot and dead pixels, pixel-pixel 
non-uniformity) are modelled and added. Saturation ceilings 
are applied and the associated pixel bleeding calculated. 

(vii) CCD radiation damage models are then applied to 
reproduce the radiation damage effects, again using repre- 
sentative laboratory data from radiation testing to ensure 
their fidelity. Readout noise for the CCD readout node is 
finally added, together with the electronic bias level. 

(viii) Full-scale mosaics are produced using a simulated 
CCD metrology for the full array. 

The survey strategy may also require simulation to en- 
sure that the anticipated mission samples the sky adequately 
to allow the information about the galaxy shears to be re- 
covered to the required level of accuracy. 



3.2 Radiation Damage Effects 

While most of the procedures above are relatively standard, 
the modelling of the radiation damage effects in the CCDs 
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requires particular attention, as this is where most of the 
non-convolutive effects in the system arise. 

The radiation environment above the Earth's atmo- 
sphere will gradually degrade the performance of all elec- 
tronics. The principal impact for a CCD-based weak lensing 
experiment will be changes to image shapes as a result of 
radiation-induced lattice damage in the CCDs. This will in- 
troduce inefficiencies in the charge transfer during readout 
(Charge Transfer Inefficiency; CTI). As electrons are trans- 
ferred to an amplifier at the edge of the device, they can be 
temporarily captured by lattice defects (traps) and released 
only after a time delay (e.g. Holland et al., 1990). These 
electrons then appear in pixels subsequently read out, as a 
spurious trail behind the image in both the column (paral- 
lel) and row (serial) directions. The degradation is negligible 
in pixels adjacent to the readout node, because electrons un- 
dergo few transfers before being read out, and worst at the 
positions furthest from a node. The effect will modify the 
size and introduce a spurious elongation of galaxies (Massey 
et al., 2010), dependent on flux and its position on the CCD, 
directly modifying the cosmological weak lensing signal if 
not accounted for. CTI trailing is particularly troublesome 
because the trailed flux is a nonlinear function of the total 
flux (signal plus sky background), and of the size and shape 
of a source and therefore contributes as a non-convolutive 
effect. 

There are two main types of CTI models used in the 
simulations. In the first, the charge transfer process is mod- 
elled statistically in detail, including the interactions be- 
tween the charge cloud and the electric field structure within 
the pixels, by Monte Carlo techniques (e.g. Seabroke, Hol- 
land & Cropper, 2008; Prod'homme et al, 2011). These 
potentially offer the highest fidelity description of the ra- 
diation damage effects. However because the parameters in 
the model (mainly the capture and release times, and the 
capture cross sections) are determined by iterative fitting 
to laboratory data, these parameters are in practice not 
well constrained, as the models are computationally inten- 
sive. The other approach is to capture as well as possible in 
a simplified model the essential physical interactions while 
modelling the statistical effects: this enables rapid iterative 
parameter fitting at the cost of a reduced fidelity. Examples 
include the model developed for the Hubble Space Telescope 
Advanced Camera for Surveys (Massey et al., 2010) and 
CDM03 (Short et al., 2010) used in the Gaia programme, 
both of which have variants explicitly tailored for weak lens- 
ing surveys. Unfortunately, the approaches are currently suf- 
ficiently different that some of the parameters determined in 
the second approach are not yet directly useful for the Monte 
Carlo approach. 

The radiation environment to be experienced by the 
detectors is mainly parameterised by the mission duration, 
as well as the fluence and energy spectrum at the orbital 
location. All laboratory test data should be as representa- 
tive of the flight condition as possible. Because lattice dam- 
age effects are different at ambient and cold temperatures, 
irradiations of test devices should be made at operational 
temperature, and the devices should be maintained at that 
temperature for the subsequent characterisation: this is lo- 
gistically difficult. The main operational dependencies are 
the temperature and the CCD parallel and serial transfer 
rates, and these will be the main parameter space to be ex- 



plored in the testing. Both are affected by the shape of and 
voltage levels used in the actual waveforms to read out the 
device, and generally this will be explored beforehand and 
should be agreed and standardised for all further tests, to 
permit the inter-comparison of results. The results will also 
be different for different device types, because of the differ- 
ent physical pixel structure, manufacturing procedures and 
raw Si characteristics. In particular, different doping regimes 
will lead to different trap populations. 

Once the parameters are determined by fitting to labo- 
ratory test data, the models are used to include the radiation 
effects in the simulated images in step (vii) above. Generally, 
the test data improve as the programme proceeds, partly be- 
cause actual flight-design CCDs may not be initially avail- 
able. Consequently the simulations will evolve. 

3.3 Cosmic Rays 

The effect of high-energy ionizing particle fluxes (electrons, 
protons, and ions) on the detectors in terms of induced tran- 
sient tracks on the images must be included in the simula- 
tions because they effectively reduce survey area when they 
are excised, or the local exposure duration if there are multi- 
ple exposures of each field. There is also an allocation made 
to any systematic effects this may induce in Table [T] The 
fluxes can be simulated using Monte-Carlo codes such as the 
STARDUST code (RoUand et al, 2007). These are able to 
compute realistic samples of images and the statistical prop- 
erties of the induced particle tracks. They incorporate solu- 
tions of the diffusion equation and take into account the nu- 
clear reactions, the shield anisotropy description, the prop- 
agation of energetic electrons and the generation of delta 
electrons. 

The main input data are the information about the 
detector structure, the environment particle spectra and a 
thickness table describing the shield around the detector. 
Generally a particle travelling inside the detector is assumed 
to lose energy along its trajectory according to the contin- 
uous slowing down approximation or via the production of 
delta electrons or by nuclear reactions (for the protons and 
the ions). The number of deposited electron-hole pairs can 
be obtained by dividing the deposited energy by the energy 
necessary to create a thermalized electron-hole pair (3.6 eV 
in Si). Charges, primary or secondary deposited in depleted 
zones must be directly collected; otherwise, they are subject 
to diffusion. 



3.4 Normal processing steps 

The raw data from a weak lensing experiment is not used 
directly. A data processing sequence is carried out to reach 
the required data quality from which the galaxy shears can 
be measured. This incorporates external information (such 
as parameters from laboratory tests, or astrometric source 
parameters) and internal calibration data. The information 
can be used either directly, or through a model. This adds 
to the information content of the data, but very great care 
must be taken in the quality of the external information, and 
in the acquisition of the calibration data to ensure that these 
are taken in a representative manner. At the extreme level of 
accuracy required for the weak lensing measurements, this 
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incorporation procedure will always, at some level, introduce 
bias effects feeding into A and A4 , and this is reflected in the 
need for an allocation for incorrect values in Table [1] Other 
calibrations of, for example dense star fields, can potentially 
be used to examine PSF spatial variability on small scales, if 
these exposures can be considered sufficiently representative 
in order not to introduce spurious biases. Wherever possible, 
the calibration information should be from within the data 
frames themselves. 

The processing follows a process of electrical bias (zero 
light level) subtraction, correction for linearity, correction 
for CTI effects, flat-fielding, correction for detector cosmetic 
defects, cosmic ray subtraction and astrometric correction. 
Scattered light contributions may be removed and the back- 
ground modelled. Because the knowledge of the PSF is one 
of the most critical aspects in a weak lensing experiment, 
the steps requiring the most attention are those that impact 
the PSF. 

3.5 Correction for Radiation Damage 

The absolute density of charge traps will gradually increase 
during the mission as radiation damage accumulates. Labo- 
ratory tests indicate that there are different species of charge 
traps. While all of these species contribute to the total CTI, 
not all of the species equally degrade weak lensing measure- 
ments. Following Rhodes et ai, (2010), charge traps with 
a characteristic release time much shorter than the charge 
transfer speed at which rows and columns of pixels are read 
out, move electrons by at most one pixel, and hence affect as- 
trometry. Charge traps with long characteristic release times 
remove electrons entirely from a source, and degrade pho- 
tometry. As shown by Massey et ai, (2012b, in prep), charge 
traps with characteristic release times a few times the charge 
transfer speed move electrons from the core of an astrophys- 
ical source into its wings, and primarily affect its size and 
ellipticity, and these have the most negative effects on the 
weak lensing measurements. 

As noted above, one of the early steps in the data pro- 
cessing is to minimise as far as possible the radiation damage 
CTI effects in the data. This is a critical step, as the residuals 
from this process will be the largest non-convolutive knowl- 
edge contributor in Table [1] the effects modify the magni- 
tude, ellipticity and position of the object in a intensity- 
and size-dependent manner. In this respect, the CTI also 
creates non-linearity which is in addition to that caused by 
the detection chain (Section 14.4. 1|) . 

Ideally, the trailed electrons should be returned back to 
the pixel to which they belong. Fortunately, the trailing is 
typically a small perturbation around the true image, so an 
inverse operation can be achieved via a rapidly-converging 
iteration of the forward algorithm (Bristow & Alexov 2002, 
Massey et ai, 2010). This is done by taking the real data, 
passing them through the best model available for CTI ef- 
fects - generally those used in the simulations in Section[3]2] 
- to create double, and higher multiples of the damage. Lin- 
ear combinations of these images are subtracted from the 
original data to remove the effects to the required level. The 
procedure is shown in table 1 of Massey et al, (2010). 

The level of correction that is possible depends upon 
the accuracy of the CCD CTI model and the level of read- 
out noise. Readout noise places a fundamental, hardware 



limit on the correction accuracy because it is added to an 
image after the charge transfer, and is therefore not trailed 
(Anderson & Bedin 2010, Massey et ai, 2012b). This effect 
leads to correlated noise in the final corrected image. 

Note that during charge transfer, charge capture and re- 
lease are stochastic events, the exact location of each trailed 
electron cannot be accurately predicted. Individual galaxies 
on individual exposures may not be perfectly corrected, but 
statistical measurements of an ensemble galaxy population 
can be corrected to an unbiased level (Rhodes et ai, 2007, 
Massey et ai, 2012b). However, with advanced calibration 
techniques known as pocket-pumping in which an image is 
shifted forwards and backwards by a small number of pixels, 
the location and characteristics of each trap may be ascer- 
tained (Janesick 2001). 

4 MODELLING THE SYSTEM PSF 
4.1 Overview 

After the data processing, including the calibrations and the 
CTI correction, a set of optimised images containing stars 
and galaxies is available for each part of the sky. We are now 
in a position to calculate the shear maps to determine the 
cosmological parameters. 

In order to reach the accurate measurement of galaxy 
shape needed for reconstructing the shear maps, classical 
deconvolution approaches do not deal adequately with the 
effects of noise and finite sampling by the detector. There 
has been continuous progress over the past decade in the 
accuracy with which shear can be measured (for example 
Heymans et ai, 2006, Massey et ai, 2007, Bridle et ai, 
2010, Kitching et ai, 2010, 2012). In one approach, for each 
particular galaxy shape measurement, a galaxy model is con- 
structed from a combination of intensity profiles. These pro- 
files are convolved with the model PSF, and compared in a 
model fitting process to the observed galaxy, the true image 
of which has been convolved with the true (but not fully 
known) instrument PSF (for example Miller et ai, 2007, 
2012, Kitching et ai, 2008b). In another, the ellipticity is 
computed more directly using the quadrupole moments as 
in Equation [12] (for example Kaiser, Squires & Broadhurst 
1995, Luppino & Kaiser 1997, Hoekstra et ai, 1998). 

An allocation for the uncertainty in this model-fitting 
process is in the model bias knowledge {S{a))^ and {S{^)) 
lines of Table [T] and some of these aspects are discussed 
briefly further in Section 14.41 We continue in this section 
with the error arising from the fact that the instrument 
PSF is not fully known: calculated or estimated values are 
in the lines of Table [T] Stars provide measures of the PSF 
at different points on the held of view, which will enable 
the PSF for any particular galaxy to be calculated. Each 
exposure will also have been taken under slightly (perhaps 
minutely) different conditions, for example changed payload 
temperatures resulting in different optical alignments. The 
PSF model for each galaxy must be reconstructed from the 
stellar PSFs in the field of view, and the task is to model the 
PSF to reach a level of fidelity to the true PSF such that the 
biases in the shear measurements must be within the levels 
in Equation [5] for the cosmological goals to be met. 

As already related in respect of Table [TJ this mod- 
elling process has several categories. Some are related to 
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the amount of information that is available to reconstruct 
the PSF for any particular galaxy. For example, the spa- 
tial sampling must be adequate f Section I4.2.1|) . Another is 
the precision available in the calibrating PSFs simply from 
their photon shot noise. Other categories include the math- 
ematical form of the model used to characterise the PSF, 
the accuracy with which the coefficients of the model can 
be derived in order to construct any particular PSF and the 
accuracy with which calibrations can be transferred to the 
particular stellar PSF being modelled. 

We now examine the main categories of convolutive ef- 
fects, non-convolutive effects, and galajcy modelling in Ta- 
ble [T] in sequence. 

4.2 Convolutive effects in the PSF modelling 

4-2.1 Sampling Issues 

Assuming a central wavelength of A for the instrument 
bandpass and a primary mirror of diameter D, all images 
are fundamentally bandlimited at a spatial frequency of 
Umax = D/X even in the presence of spatial blurring in the 
remainder of the instrument. The telescope PSF can contain 
no modes at higher frequency than this value, and thus no 
higher frequency signal remains in images after convolution 
with this PSF. A band-limited image can fully be recovered 
as a continuous function, without loss of information or ac- 
curacy, using Sine function interpolation between a discrete 
set of samples provided these samples are spaced at a greater 
spatial frequency than the critical sampling rate, or Nyquist 
rate, 2Mmax. An output image must therefore be constructed 
at a resolution of < l/{2umax) radian per sample in order 
to be fully-sampled and allow full reconstruction of the sky. 

In general, for a given optical system, and number of 
pixels in the focal plane, there is a tradeoff between maximis- 
ing the survey area (more arcseconds per pixel) and max- 
imising the PSF sampling (fewer). If the system PSF is fully 
sampled by the detector pixel grid, then it is not degraded 
by the sampling. If on the other hand some compromise is 
made to enlarge the field of view, with the expectation that 
some spatial resolution can be regained through multiple ex- 
posures, which are often required for other reasons in any 
case, then an analysis of the effect of this undersampling 
is required. Further, while undersampling is an important 
consideration in the modelling of the PSF, aliasing also af- 
fects galaxy shape measurement for methods which do not 
directly fit parametric models to the data. 

We start by examining the effect of moderate under- 
sampling with 3 or 4 exposures, as such a sequence may be 
typical to recover gaps in the detector matrix. The full sys- 
tem PSF should be used to test the sampling characteristics 
and these sampled on the detector pixel grid, with successive 
exposures offset to mimic the multi-exposure survey strat- 
egy. We use an optical PSF based on the Euclid example 
with an input focal plane sampled at 0.688 of the Nyquist 
rate. We then convolved this PSF with an additional circular 
Gaussian of standard deviation 0.196/(2?imax), to approxi- 
mate the dispersive effects of charge diffusion within the 
CCD pixels. Finally, we add an additional jitter component 
to the combined PSF using a time series of 216 jitters of 
displacement 0.145/(2uinax) (Gaussian rms). 

We use the optimal linear image combination formal- 



ism of Rowe, Hirata & Rhodes (2011), sampling the output 
at the Nyquist rate. This formalism aims to minimize two 
contributions to imperfect image reconstruction: the leak- 
age objective Up and the output noise variance Epp. The 
former measures the fidelity of the output image to the 
target PSF: in this test a low value for Up indicates that 
unwanted changes to the PSF from the linear combination 
process have been small. To set an absolute tolerance on 
this quantity, it is useful to consider the normalized leakage 
objective Up/Cp, where Cp is a measure of the integrated 
PSF autocorrelation (see Rowe et al., 2011). A tolerance 
value for a normalized leakage objective of Up/Cp < 10~* 
approximately corresponds to controlling unwanted changes 
to the PSF to better than one part in lO** and ensures that 
such changes are a minimal contribution to the PSF uncer- 
tainty budget in Table [T] The output noise variance Epp is 
specified in units of the variance of noise in the input im- 
ages. An output Epp < 1 therefore demonstrates that the 
noise variance in the output pixels is reduced relative to the 
noise in the inputs, and this can be taken as an indication 
of stable control of noise in the reconstructed, fully-sampled 
output image. Following the methodology of Rowe et al., 
(2011), we can test to see whether the PSF and multiple- 
exposure strategies will allow linear combination of input 
images to generate output images that are unbiased at the 
Up/Cp < 10~* level while simultaneously keeping output 
noise to an acceptable level. 

In the upper panels of Figure [T] we show maps of nor- 
malized Up/Cp and Epp (given in units of the input noise 
variance) for a single realization of a four randomly-offset 
exposure system (dithers) in which the sampling is 0.688 of 
the Nyquist rate. This realization was one of 30 realizations 
tested, and results were typical. Up/Cp is found to be < 10~^ 
everywhere. The output maps shown come from the central 
regions of the input images, where data coverage is good and 
edge effects do not impair results. As discussed in Rowe et 
al., (2011), the effects in edge regions can be mitigated in 
real data by tessellating many small regions of reconstructed 
output such as those shown. 

For the ensemble of 30 realizations tested, the average 
Up/Cp in the reconstructed output was 9.95 X 10~^ in the 
very centre of the specified tolerance range, demonstrating 
a desired level of control over unwanted distortions in the 
output image. The average noise variance Epp in the recon- 
structed output was 0.663 (in units of the input variance). 
Because there is no background in the images, this vari- 
ance will be an upper limit with respect to real sky expo- 
sures. This demonstrates that for a PSF sampled at 0.688 of 
Nyquist, with four input exposures, a linear combination of 
images can be used to generate fully-sampled output while 
maintaining acceptable levels of noise in the output. 

The lower panels of Figure [T] show the normalized 
Up/Cp and Epp for a single realization of a three expo- 
sure pattern. Here the optimal linear combination produces 
Up/Cp > 10~* at some points in the output image: these 
can be seen as red squares. It can also be seen that that 
output noise variance for this reconstruction is significantly 
greater than was the case for four input exposures. 

As for the four exposure case, a total of 30 realizations of 
the three exposure scenario were investigated. The average 
Up/Cp in the reconstructed output pixels across this ensem- 
ble was 1.0583 x 10~*, slightly larger than the desired max- 
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Figure 1. Maps of normalised output leakage objective Up/Cp (left eolumn) and noise variance Spp (relative to a unit input variance; right 
column) after linear image combination from single realizations of a four (upper row) and three (lower row) random dither configuration. 
These output maps are taken from the central regions of large images, showing the central regions of primary interest (see Rowe et at, 
2011). 



imum for reconstruction, and the average Epp was 1.4405 
(in units of the input variance). The results from a single 
realization shown in Figure [T] are typical of these tests, but 
the variation in reconstruction quality between realisations 
was noticeably greater than in the four exposure case. 

In Figure[2]we plot histograms showing the distribution 
of Up/Cp and Epp output pixel values for the full ensemble 
of 30 realizations of the three exposure random dither pat- 
tern. As each output region of the type shown in Figure 
[T] consists of 36 x 36 output pixels, there are 38880 total 
output pixel locations making up the full sample for each 
of these histograms. We also provide some statistics of the 
distributions, showing that in the three exposure case, only 
~ 18% of the reconstructed output had Up/Cp > 10^*, but 
nowhere did this quantity exceed 3 x 10^**. Nearly two thirds 
of the output pixels have a noise variance smaller than the 



noise variance on input pixels, and for less than 10% of the 
output is the variance greater than a factor of three times 
the input. 

Because both Up/Cp and Epp are squared metrics of the 
quality of reconstruction (Rowe et ai, 2011), these results 
suggest that while the three exposure case does not meet 
stated requirements in this 0.688 Hyquist-sampled case, it 
comes close. Figure [1] shows that this failure to meet the 
tolerance is spread regularly over the survey regions. It is 
not clear to what extent this regularity, and the failure 
to meet Up/Cp will effect weak lensing measurements for 
this 800nm PSF. We also note that for shorter wavelengths 
within the bandpass the Nyquist frequency is correspond- 
ingly increased, exacerbating the situation. On the other 
hand, the analysis in Section 14.31 (where the three exposure 
case including CTI is propagated into the shear power spec- 
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Figure 2. Histograms of normalized leakage objective Up/Cp 
(left panel) and noise variance Spp (in units of input variance; 
right panel) for a total of 38880 output pixels from 30 realiza- 
tions of the three input exposure scenario. 



trum) indicates that such a variation has a hmited impact 
on the Dark Energy FoM. 



4.2.2 Construction of the PSF 

Because the PSF wiU be derived from the stars in the field 
of view surrounding each galaxy, a fundamental limit on 
the fidelity of this model is set by the photon statistical 
error in each pixel containing the PSF. If there are sufRcient 
bright stars in the field, then the form of the PSFs in each 
exposure can be modelled to the necessary level of fidelity, 
exposure by exposure: in their analysis, Paulin-Henriksson et 
al, (2008, 2009) found that about 50 stellar PSFs at a signal- 
to-noise ratio of 500 were sufficient to determine the PSF to 
the accuracy required for the particular PSF that they were 
using. The surface density of stars which both are not nearly 
saturated (i > 18.3) and which would have signal-to-noise 
ratio greater than 500 is expected to be about 950 deg~^ 
in a survey such as Euclid , for fields with Galactic latitude 
b ~ 30° near the North Ecliptic Pole. If the system is stable 
between slews, then measurements of stars from successive 
fields may be combined appropriately to improve the PSF 
model. This approach is shown in Figure [3] However, this 
stability is not strictly necessary, if the modelling can take 
into account the variation of the PSF with time, capturing 
all of the possible states of the system. The PSF can then be 
reconstructed for a particular position on the field of view, 
and for a particular instrument state. 

The instrument state is defined through a (large) num- 
ber of parameters, for example mirror separations and align- 
ments in the optical train. Hence the "time" dimension in 
Figure 13] can be replaced with a separate dimension for each 



Images 
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yi 



3D model of PSF generaled. 
A function of space and lime 



Figure 3. The PSF is expected to vary as a function of spa- 
tial position in the field of view and with time, resulting from 
changes in the instrument state. The instrument state may be 
that, for example, characterised by the primary-secondary mirror 
separation, or the temperature difference between certain opti- 
cal elements. These individual contributions may substituted for 
the 'time' column. An additional parameter will be the effective 
wavelength of the light producing the PSF. 



parameter. Some of these (such as the primary-secondary 
mirror separation in the telescope, leading to focus changes, 
examined by Ma et al., 2008) will, however, be dominant, 
and not all physical changes will induce ellipticity, so in prac- 
tice the additional dimensionality should be constrained. 
The additional dimensionality beyond the 2 dimensions of 
the focal plane reduces the accuracy with which the PSF 
can be constructed, but with typically > 10® suitable stars 
in a long survey duration, even a large number of additional 
dimensions can be accommodated. Then in principle all of 
the exposures in the survey can be used, and a multitude of 
PSFs will be available to construct the PSF for any galaxy. 



4-2.3 Principal Component Analysis of the PSF 

Any PSF can be modelled through a combination of func- 
tional forms. Which functional forms are optimal will de- 
pend on the criteria by which this is assessed. One simple 
criterion may be that each of the components making up the 
PSF should be orthogonal; another may be that a minimal 
set should be used, requiring that the series of components 
should converge rapidly. 

Principal Components Analysis (PCA) is a general sta- 
tistical method that enables variation in data to be iden- 
tified in a way that makes minimal assumptions about the 
nature of the underlying variation. More formally, the PCA 
methodology is a mathematical procedure that uses orthog- 
onal transformations to convert a set of correlated variables 
into a set of uncorrelated variables called principal compo- 
nents. PCA also determines the coefficients which describe 
how much of each component should be used. 

PCA makes the assumption that modes of variation are 
additive. This may be restrictive when changes in PSF result 
from, for example, focus variation, so other, more physically- 
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Figure 4. Components (left) and residuals (right) depending on number of components used for modelling a PSF (top). The intensity 
scales are logarithmic, with a colour table which enhances lower image levels. 



described models, such as those directly coupled to the op- 
tical modes (e.g. Schechter & Sobel Levinson, 2011) may be 
more efficient. 

In applying PCA we may consider the input data to 
be the PSFs provided by stars, and the input (correlated) 
variables to be the position of the PSF in the field of view, 
the spectral energy distribution (SED) of the photons in 
the bandpass and the parameters describing the instrument 
state (such as the focus). Each component of the PCA basis 
set derived from the PSF is an image, and the components 
together generate an orthogonal set of 2-dimensional im- 
ages. This is illustrated in Figure |4l As there is a coefficient 
for each component to instruct how much of that coefficient 
should be used in the construction of a PSF, the coefficients 
are vector functions, with length corresponding to the num- 
bers of PCA components. The dependencies in the derived 
component functions are the positions in the focal plane, the 
SED and the instrument state. 

PCA PSF reconstruction has been successfully imple- 
mented on space-based weak lensing data from the Hub- 
ble Space Telescope (Jee et ai, 2007) (see also Rhodes et 
al, 2007, Schrabback et al, 2007, 2010 who used PCA to 
characterise the variation of the two-component ellipticity, 
rather than the PSF pixel values). We could use the stellar 
(noisy, pixellised) images themselves to generate the PCA 
components and coefficients. As described in Section [4.2.11 
the problem with undersampled data is that Fourier modes 
above the Nyquist sampling limit are not only lost, but are 
aliased to lower frequencies, resulting in corruption of all 
Fourier modes. The apparent shape of the PSF depends on 
the sub-pixel location with respect to the detector pixel grid, 
and no linear interpolation scheme can allow us to predict 
the PSF at one location, given its form at another. The effect 
of undersampling is to corrupt the PCA component calcu- 



lation to make them no longer orthogonal, and the presence 
of noise results in spuriously high coefficients, particularly 
at higher-order eigenmodes. While this may be mitigated 
by the use of multiple, dithered exposures, in the presence 
of noise, it may not be possible to make a unique, method- 
independent reconstruction of a fully-sampled PSF. 

One approach to estimating the high-frequency modes, 
beyond the sampling limit of an individual observation, is 
to create a super-resolution model of the PSF, fitted to the 
data. We illustrate this approach here by creating basis-set 
models based on simulated super-sampled PSFs as a func- 
tion of focal plane position and SED for each state of the 
instrument. PSF variation is created using a Monte Carlo 
approach to vary the instrument's optical characteristics. 
The eigenmodes of those PSF variations are found, and the 
coefficients for this basis set are calculated by fitting to sim- 
ulated observations. This process is illustrated schematically 
in Figure [S] In this way we can test the accuracy to which 
a super-resolution PSF may be reconstructed provided we 
have accurate PSF models: such a test investigates the in- 
formation limit of the data, but does not probe our ability 
to generate accurate models. 

In Miller et al., 2013, the fitting procedure is treated as 
a Bayesian estimation problem, in which we use our prior 
information about the statistical distribution of the eigen- 
modes from the simulations, together with measurements of 
the likelihood of the models fitted to the data, to obtain the 
statistically most-likely PSF reconstructions. Such a proce- 
dure has the advantages that it makes full use of the avail- 
able information about the system; it places the problem in 
a rigorous statistical framework; being a forward-modelling 
process, we may include all effects that we believe are present 
in the real system; and the Bayesian approach prevents over- 
fitting of noise. In the case that the model PSFs are too far 
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Figure 5. Procedure for using simulated PSFs to generate the 
basis set of 2-dimensional images which are fitted to the simulated 
image data. In our simulations, one of the optical PSFs not used 
for the basis set construction is used to generate 144 realisations 
of realistic stars, including realistic levels of noise, AOCS and 
detector response. 



from the actual ones in orbit, we expect that the model basis 
set may be updated in-orbit as more information becomes 
available from star measurements. 



4-2.4 Characteristics of the Basis Set 

Before we perform the Bayesian fitting to the noisy pixel- 
lated data, we first examine the characteristics of the eigen- 
mode basis set, and investigate how many components may 
be required to adequately model the PSF. In detail, the re- 
sults of this analysis will depend on the nature of the optical 
system, the pointing performance, and the detector charac- 
teristics, but the procedure would be similar for any realistic 
system. 

To test this we first generate end-to-end simulations 
as described in Section |3] above, using the Euclid case. In 
summary, we first determined the optical system behaviour 
from simulations of the optics, with variations imposed on 
the optical system over agreed ranges, and then with the 
simulated system being perturbed as expected in orbit by 
convolving the optics PSF with the charge spread within 
the detector and with a kernel arising from guiding errors. 
The PSFs are oversampled by a factor 12 compared with the 
Euclid visible detector sampling. There are no noise sources 
in this test. 

After this we measure the ellipticity of the PSF using 
Equation [To] with a wide Gaussian weighting function with 
a{w) = 4 times the FWHM. We first examine the required 
number of components needed to model the spatial varia- 
tions of the PSF accurately over the full field-of-view for a 
monochromatic PSF at 800nm (Figure [6|. For the EucUd 
case, we find that ~ 18 components are enough to describe 
the PSF spatial variations with sufficient accuracy. This will 
be similar for other systems in practice. We now examine 
the number of components to encompass the variations in 
the opto-mechanical system post-launch and in the space 
environment, for a single field point (derived from a Monte 
Carlo tolerancing analysis) and again find that ~ 20 compo- 



nents are adequate. If we combine both, the number of com- 
ponents required to model correctly both spatial variations 
of the PSF and variations corresponding to the instrument 
state rises to ~ 38. 



4-2.5 PSF Wavelength Dependence 

So far the PSF we have been modelling is monochromatic. In 
reality, the PSF will be different, depending on the spectrum 
of the star generating that PSF multiplied by the instru- 
ment end-to-end throughput as a function of wavelength. 
The largest contributor to this effect is the diffraction in 
the optical system, which increases linearly with wavelength. 
This is generally counteracted slightly by the inverse wave- 
length dependence of the charge spread in the CCDs be- 
cause photons of redder wavelengths travel deeper into the 
pixel and closer to the electrode structure before they are 
absorbed. The other contributions (the attitude control sys- 
tem pointing variation and the radiation damage effects) do 
not have any wavelength dependence. 

We have examined the number of principal components 
that will be required to model a multi-wavelength PSF. In 
a new analysis, we have added the wavelength dependence 
effect by considering a set of monochromatic PSFs at 550 
nm and 800nm. Because the size of the core of the PSF 
changes approximately linearly with the wavelength, this af- 
fects significantly the ability of the eigenmodes to represent 
the wavelength dependence. To reduce the number of com- 
ponents, a spatial rescaling by a factor 800/550 of the PSFs 
at 550nm has been applied. With this simple measure the 
number of components required to model the PSF correctly 
including the wavelength dependence effect, the spatial vari- 
ations and the telescope stability, rises to ~ 70. This result 
is shown in the right panel of Figure [S] 

This analysis suggests that the eigenmode approach in- 
deed enables the full range of PSF to be modelled in a rep- 
resentative fashion, albeit at the price of potentially needing 
a large number of eigenmodes in the analysis. However, this 
analysis does not take into account the relative importance 
of the modes at 550 nm and 800 nm in actual data: for re- 
alistic spectra, the long-wavelength parts of the spectrum 
dominate the PSF, and thus the actual modes needed in 
practice may be fewer than would be implied by Figure [§] 



4-2.6 Bayesian Model-fitting 

Having explored the approximate number of eigenmodes 
that may be required to construct the PSF, we now examine 
whether the Bayesian approach discussed above can provide 
sufficient information on the coefficients of this component 
set for reconstructing the PSF in realistic simulations, to 
meet the allocations in Table [T] For this purpose a conser- 
vative assumption is to limit the amount of temporal stabil- 
ity required and analyse each set of exposures of a region of 
sky independently of any other field. The aim is to investi- 
gate the extent to which the underlying, fully-sampled PSF 
may be reconstructed from noisy data in a single field. We 
do, however, assume that each field is observed with three 
dithered exposures, and that the PSF is invariant during 
those dithered exposures. 

The first step of this reconstruction is to define the set of 
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Figure 6. (Left) The difference between the known ellipticity and the modelled one as a function of the number of principal components 
used for the modelling. Diamonds represent the standard deviation of the residual error considering the spatial variations of the PSF 
over the field of view, triangles represent the standard deviation of the residual error considering the variations of the PSF arising from 
variations in the instrumental state, and squares represent the standard deviation of the residual error considering both together. (Right) 
As, for the left hand panel, but with the diamonds representing the standard deviation of the residual error considering simultaneously 
the wavelength dependence together with all of the variations of the PSF over the field of view and the stability of the telescope. The 
dashed line indicates the value assigned in Table ^ 



basis model components that characterize the system using 
normal mode decomposition as described in earlier sections. 
We then fit these components to noisy realisations of stars. 
The star profiles are taken in turn from the set of model 
PSFs, but excluding that profile from the determination of 
the components of the models above. As before, the PSF 
used in this analysis is the Euclid system PSF taking into 
account the opto-mechanical, detector and attitude control 
system pointing variation contributions. Provided the in- 
formation on the pointing variation is telemetered by the 
spacecraft, the effect of this uncertainty on the PSF may 
be corrected, to a certain level of accuracy. On the other 
hand, we could proceed without this information, and then 
the guiding errors would need to be included as additional 
fit parameters. For this test, it is assumed that the CTI has 
been fully corrected in prior data processing (e.g. Massey et 
al, 2010): the efficacy of this is described in Section [4.31 

The simulation uses the Besangon model of the Milky 
Way (Robin et al, 2003) to predict the number- magnitude 
relation of stars at the North Ecliptic Pole in the CFHT- 
system i bandQ There are 3.5 stars arcmin^^ in the range 
18 < i < 23, and 6300 stars in a half square degree, cor- 
responding to the Euclid full field of view. The Besangon 
model also allows the creation of a simulated catalogue of 
stars with magnitude and spectral type, and to create the 
simulated Euclid observations, stars were randomly selected 
from that catalogue. As the stellar PSFs used in the PSF 
modelling are all moderate or high signal-to-noise ratio, their 
colours will often be known from catalogs, such as that which 
ESA's Gaia mission will produce. Here, we assume that their 
optical and near-infrared magnitudes can be measured from 
the Euclid mission data alone. To model the PSF in the 



presence of the varying colours of stars, a simple model was 
assumed for the PSF wavelength dependence, in which the 
optics component alone was assumed to scale in angular 
scale linearly with wavelength, with respect to simulated 
PSFs calculated for wavelength 800 nm. While this model 
is an oversimplification of the true wavelength dependence, 
it serves to capture the basic effect and allows us to test 
whether, in principle, the PSF could be reconstructed at 
the required level of accuracy. Simulated stars were cre- 
ated by dividing the EucUd visible instrument passband into 
small wavelength intervals, evaluating the expected num- 
ber of detected photoelectrons in each wavelength inter- 
val, given the SED of each simulated star, and coadding 
the wavelength-stretched PSFs across the bandpass with 
wavelength-dependent weight given by that number of pho- 
toelectrons. Star SEDs were obtain from the "UVK" library 
of Pickles (1998)0 In the measurement /fitting test, stars 
were assumed to have noisy photometric measurements, 
from which an estimated SED was evaluated using the same 
stellar library (i.e. assuming that star SEDs may be obtained 
from broad- band photometry without systematic error), and 
PSF eigenmodes were adapted to the SED of each star using 
the same SED-weighting procedure that was used to create 
the simulated observations. The model PSFs and simulated 
stars are not expected to match exactly because of the intro- 
duction of photon shot noise in the simulated stars. It was 
further assumed that 30 percent of stars might not be mea- 
sureable owing to the effects of confusion with faint galaxies 
and image artefacts. 

In this test some account was taken also of the varia- 
tion in PSF across the Euclid field. The field was divided 
into five zones of equal area, and PSF models were calcu- 
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lated at five locations (the four corners and the centre of 
the field). When the PSF eigenmodes were created, pixels 
were included from all five PSF models, so that the normal 
mode analysis generated position-dependent modes, albeit 
sampled only at five locations. The PSF was assumed to 
be spatially invariant within each zone: a more advanced 
method should allow interpolation as a function of position 
in the field. 

Information from three dithered simulated exposures 
was used by jointly fitting the PSF models to all three ex- 
posures. In this test, the absolute positions of stars were 
assumed to be unknown, and were marginalised over in the 
fitting, but the relative positions of stars on each of the three 
exposures was assumed to be fixed and known. In practice, 
the relative registration of multiple exposures should be de- 
terminable to very high accuracy from joint analysis of all 
the stars in the field. By assuming the star positions to be 
unknown, we are discarding potentially useful information 
on the field distortion, which also provides information on 
the optical path. In practice, accurate absolute star posi- 
tions may be available from the Gaia mission data, which 
could also be included in the analysis. 

Further description and evaluation of the above proce- 
dure will be provided by Miller et at, (2013). In this ini- 
tial evaluation, forty modes were found to capture the PSF 
variation at an adequate level. This number is fewer than 
expected from Figure [6] because of the weighting of the con- 
tributions from differing wavelengths by the SED photon 
counts. In line with the formalism established above, the 
PSF reconstruction is evaluated by the statistics a[R^y 
and cr^fe^] of the differences between the input PSF image 
and the reconstructed PSF, both quantities being measured 
from the image-weighted second moments. The results are 
shown in Figure [7] In this preliminary evaluation, a sys- 
tematic offset was found in the value of ellipticity in some 
parts of the field, which does not appear in the rms statis- 
tics shown in Figure [T] While, in a full PSF modelling sys- 
tem, such a systematic would need to be eliminated, the 
exercise presented here nonetheless shows that, in princi- 
ple, sufficient information exists in simulated observations 
of realistic stars fields to allow accurate PSF reconstruction 
to the levels assigned in Table [T] For the full-field simula- 
tion of 4400 used stars, the uncertainty a[R^\/R^ in size is 
< 1.5 X 10"* and that in ellipticity, o-^[|eicl] < 1-0 x 10"" 
per ellipticity component. The allocations aggregated from 
several contributions in Table[l]are cr[i?^]/i?^ < 4.8 x 10"'* 
and o-'^[|ei(,|] < 1.5 x 10~* per ellipticity component. 

The results presented here show that it is possible in 
principle to reconstruct the PSF to sufficient accuracy to 
meet the science requirements set by Equation [5] and organ- 
ised in Table [1] by normal mode model-fitting to observa- 
tions of stars in single fields with a small number (three) 
of dithered exposures, even if no longer-timescale temporal 
information is used. The requirements and values in Table [1] 
will be different for different experiments, so this will need 
to be evaluated on a case-by-case basis. Should additional 
margin be required, the temporal information could be ex- 
ploited. 



4.2.7 Flat Fields 

Because of manufacturing tolerances, all CCDs are subject 
to slight variations in their pixel-to-pixel sensitivity. This 
is called photo-response non-uniformity (PRNU). This is at 
least partially caused by differences in pixel size in the pho- 
tolithographic mask sets used to manufacture the CCD, but 
the PRNU can also show colour dependence, which indicates 
that other effects also contribute. 

At low signal levels the pixel-pixel variations on typ- 
ical CCD exposures are dominated by the readout noise. 
At higher signal levels the Poisson noise, increasing as y/N 
(where A'^ is the number of counts in the pixel) dominates. 
At even higher levels the PRNU, increasing as A'^, starts to 
dominate, even though the intrinsic PRNU is typically only 
2%. Generally, stars (which fall on several CCD pixels) will 
be used to calibrate the PSF and many of these stars will 
be close to the saturation level, where PRNU dominates. 

Typically, an on-board flat fleld calibration source is 
used to provide an even illumination over the CCDs in order 
to measure this PRNU. By accumulating several flat field ex- 
posures, a high signal-to-noise ratio normalised PRNU map 
can be accumulated. Dividing science exposures by this map 
- flat-fielding - can almost eliminate the PRNU degradation. 
It is essential, however, to achieve accuracies in the accumu- 
lated fiat-field which are sufficient, or else the flat-fielding 
can instead add noise to the image. An approximate way to 
minimise the effect of the flat fleld calibration on A and M 
is to ensure it is smaller by a factor C, compared to the Pois- 
sion noise, the effect of which which has been calculated by 
Paulin-Henriksson et al., (2008). If a number npsp bright 
stellar PSFs are used, and we assume the brightest pixels 
in these PSFs are flUed to the same level as that provided 
by the flat-field illumination, then the number of fiat fleld 
exposures Uf = CxA^psF (here we have used the fact that 
the fiat field under each stellar PSF is different). For C = 3, 
npsF = 50 (Paulin-Henriksson et al., 2008) then ni ~ 20 
flat field exposures are required to be combined. In prac- 
tice the frequency of fiat field exposures in order to achieve 
this number (with consequential operational overheads) will 
be driven by the timescale of the temporal changes in the 
PRNU, which is still unknown at this level. 



4.2.8 Pointing Accuracy Issues 

The imperfect operation of the satellite's attitude control 
system contributes to the PSF of an exposure because the 
telescope axis is not perfectly stable, and the combination of 
pitch, yaw and roll leads to field-dependent displacement of 
the images. In the analysis in Section r4.2.6l we assumed that 
these pointing displacements are provided by the spacecraft. 
These will not be noise free, but provided prior information 
for the normal mode analysis which was used in the forward 
modelling. 

Even in the absence of this information, Ma et al, 
(2008) have shown that when the pointing variation is much 
smaller than the width of the Airy disk of the optics PSF, 
its effect on the observed PSF is described by the mean dis- 
placement and the covariance matrix of the displacements. 
In this case the detailed pointing variation history is not 
important, and in principle only two stars are needed to de- 
scribe this contribution anywhere in the fleld of view. In the 



© 0000 RAS, MNRAS 000, 000-000 



Defining a Weak Lensing Experiment 19 





Figure 7. Variation in tlie residuals in tlie knowledge of tlie PSF in terms of size cfi?^] (left panel) and a single component of the 
convolutive ellipticity (T^[eij^] (right panel) as a function of a range in PSF size and ellipticity, derived using the normal mode modelling 
of 4400 stars. The square and diamond symbols correspond to the two ellipticity components. The dotted line in the right panel is the 
level in Tabled below which the ellipticity requirements for a single ellipticity component allocated to this contribution is met. The 
equivalent level for the size in the left panel is above the top of the plot, indicating that the knowledge uncertainties for this contribution 
are easily met. 
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Figure 8. The end-of-mission residual ellipticity |ejvcl in ^ single 
component induced by CTI effects in the detector system after 
correction during image processing for the worse case configura- 
tion: faintest and smallest galaxies located furthest from the read- 
out node. The degree of correction is limited by readout noise, and 
the required level is met for a readout noise of < 4.8e~ . The cor- 
rection assumes a correct model of the CCD charge transfer from 
pixel-to-pixel during readout. 

presence of noise more stars are needed, but in the typical 
case as discussed above, all ~ 4000 stars in the field of view 
can be used in the normal mode analysis to determine the 
pointing variation contribution for each exposure, requiring 
only a few additional components. If the pointing variation 
amplitudes are comparable or larger than the optics PSF, 
the mean and covariance of the displacements is not suffi- 
cient and the pointing history is required. 

4.3 Residuals in the Correction for Radiation 
Damage 

Having estabUshed the major convolutive effects, we now 
consider the extent to which the non-linear CTI effects 
caused by radiation damage can be corrected in the data 
processing described in Section [3.51 



Figure |8] shows the residual ellipticity in the galaxy im- 
ages after the image post-processing. This is plotted as a 
function of the readout noise of the detection chain on the 
abscissa, because readout noise is an important limiting fac- 
tor, as discussed in Section [3.51 The actual value on the or- 
dinate will depend on a number of parameters, for example 
the CCD characteristics, the fluence received by the CCD, 
background levels, signal-to-noise ratios etc. These will all 
be inputs to the simulations discussed in Section [3.21 Here 
we assume a five year mission in a deep orbit, four of which 
are at Solar maximum. At the end of mission, taking account 
of a nominal focal plane shielding and some margin, this ac- 
cumulates to a fiuence of 6 x 10^ protons cm~^ (scaled to 
the effects of protons of energy 10 MeV) a typical value for 
such a mission. The example uses small (minimally sampled) 
galaxies with a signal-to-noise ratio of 10, located far from 
the readout node (requiring ~ 2000 transfers). A readout 
noise < 5 electrons (which is generally reachable with care- 
ful design in a CCD detector matrix, as long as the readout 
speed is not too high) enables a residual ellipticity knowl- 
edge of cr[|eNcl] < 10"*. Because this analysis addresses the 
performance for the faintest galaxies in the worst position 
for CTI effects, this knowledge error will be smaller if the 
population ensemble of all galaxies that will be studied for 
the weak lensing is substituted and if they are placed ran- 
domly with respect to the readout node, and, further, if the 
average value of the radiation damage is used, rather than 
the end-of-mission level. 

In this case, the CTI has been generated by a model in 
the simulations, and corrected using the same model: hence 
this conclusion is reached using a perfect CTI model. Cur- 
rently, the best that has been achieved in practice is a fac- 
tor 20 reduction by Massey et al, (2010). In addition to the 
readout noise, the efficacy of image-level CTI mitigation will 
therefore rely on the accuracy of the CTI model compared to 
the solid-state physics taking place in the real instrument, 
both in terms of the fidelity of the model, and the accu- 
racy of the parameters used within it: these necessitate a 
substantial characterisation programme for the CCDs. The 
parameters include the trap density and the release times of 
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Figure 9. (left) The effect of imperfect CTI correction on tfic power spectrum Cij(£), multiplied by i'^ to show equal power for 
different spatial scales /. Here we use the auto-correlation power spectrum (i = j) and a mean redshift of the tomographic bin of = 1. 
We limit the power to £ < 5000 as described in Laureijs et al., (2011). The lines indicate progressive levels of CTI correction where 
i5ecTl = 2;<5cTl, uncorrected E'nd we show 10 values of X logarithmically spaced between 1 and 100. The impact is reduced as the correction 
is improved and the effect is limited to spatial scales of the CCD and smaller (corresponding to I > 2000 in the Euclid case), (right) The 
absolute difference (the difference changes sign at £ ~ 1200) in the C(t) power spectrum when the Dark Energy parameter is changed from 
wq = —1.0 to wq = —0.95 (blue): this represents the cosmological sensitivity of the C{1). The dotted lines show the absolute difference 
beween an unaffected C{t), with no CTI and wq = — 1.0 and a power spectrum affected by CTI with various levels of correction (dotted 
lines). 



each trap species. This is explored more fully in Massey et 
al., (2012b). In orbit, the parameters could be determined 
from fits to injected charge lines (during calibration expo- 
sures), to cosmic ray events, by the use of pocket pumping 
and perhaps by direct analysis of stellar PSFs. The algo- 
rithms to do this, used at the data processing stage, require 
careful development. 

While the requirements on the modelling and the deter- 
mination of the parameters for it turn out to be challenging 
with respect to the allowed values of A and M in Equa- 
tion [5] any residual shear effects in the detector coordinate 
system which are related to position with respect to the 
readout nodes (as opposed to those in the sky coordinate 
system) will be identified as inadequacies in the CTI model, 
and can potentially be iteratively nulled to negligible levels. 
The readout noise floor in Figure |5] is therefore an impor- 
tant parameter. While any final small residual errors in the 
CTI correction will not be fully convolutive, in that they are 
magnitude and background dependent, the contributions to 
the error that are linear could be incorporated with the stan- 
dard PSF modelling discussed above, and modelled out. 

Moreover, the CTI residuals will not contribute to errors 
in the derived power spectrum on all spatial scales, but will 
be limited to scales of the CCD and smaller (if the CCD is 
divided into sectors with separate readouts) . We have quan- 
tified this by analysing the effect on the power spectrum, and 
on the FoM. This analysis includes the radiation damage ef- 
fects assumed in the modelling above, including a fluence 
of 6 X 10^ protons cm~^, but with the galaxies (all again 
small and minimally sampled and with a signal-to-noise ra- 
tio of 10) now placed randomly with respect to the readout 
nodes. We use three slightly displaced exposures as in the 
Euclid pattern of ~ 100 arc sec (1000 pixels) in the parallel 
direction, one of which also has a 50 arc sec displacement 
(500 pixels) in the serial direction. The effect of a progres- 
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Figure 10. The reduction in figure of merit (FoM) with respect 
to perfect correction for the progressive levels of correction. The 
blue vertical line with <5£CTI = 0.0005 indicates the impact on 
the FoM achieved with HST data. 



sively improved correction on the power spectrum is shown 
in Figure We also show in Figure [TD] the change in the 
lensing-only FoM using the same parameters as those used 
in MHK12 for the systematic evaluations (and the same as 
those used in Laureijs et al., 2011). At all scales the differ- 
ence between the corrected power and the unaffected power 
is less that the difference in the power induced by a change 
in the Dark Energy parameter uiq = —1.0 to uiq = —0.95 
(start of Section I2.2|l . This encouragingly indicates that a 
CTI-corrected power spectrum at this level will have lim- 
ited effect on the Dark Energy measurements. 
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4.4 Model Transfer-to-Object and Model Bias 
Knowledge Errors 

4-4-^ Linearity 

The PSF model is inferred from the images of stars that are 
much brighter than the faint galaxies used in the weak lens- 
ing analysis. If tlie response of tlie detector is independent 
of flux then the model can be applied directly. Real detec- 
tors, and their associated external electronics, however, will 
have a non-linear response. This will lead to systematic er- 
rors resulting from changes in the shape of the PSF, which 
will either be more or less peaked than it should be. Note 
that this concern regarding the non-linearity is particular 
to the field of weak lensing measurements. The more usual 
concern with its effect on the overall photometric accuracy 
of the measurement may also be important, for example in 
determining the SED of the star being used for modelling 
the PSF. 

There are two main effects which lead to the nonlinear- 
ity. The first is the classic effect of non-linearity and satura- 
tion in the detector and external electronics. This is typically 
limited to ~ few percent. It can be addressed by calibrat- 
ing the detector using multiple exposures of the same field 
with different exposure times. In practice this may place 
tight constraints on the repeatability of any shutter or read- 
out mechanism. The main difficulty of these measurements 
is to obtain sufficient faint stars in order to calibrate the 
linearity at lower signal levels. After calibration, residual 
non-linearities from these effects can be constrained to ex- 
tremely low levels, especially as they are expected to vary 
only slowly with time, depending on operating voltage levels 
within the electronics. 

The second effect arises from the CTI in the detectors 
caused by radiation damage. Those traps with long release 
times remove photoelectrons from the PSF entirely. If the 
population density of traps encountered by the charge cloud 
as it is transferred through the CCD does not increase lin- 
early with signal level, this effect can induce non-linearities 
in the relationship between the optical flux level and the 
charge arriving at the readout node of the CCD. The effect 
also depends on the level of background light, mostly Zo- 
diacal, which has ameliorating effects particularly for these 
slow traps, and on the prior history of PSFs read out ahead 
of the PSF of interest. The effect can be corrected using 
the same model and technique used to correct for the faster 
traps (Section 13. 5|) . and again the residuals will be identi- 
fied from their amplitude in the detector reference frames. 
This will be a monotonically increasing effect as the mission 
progresses. 

The residual of the linearity correction is included in 
different categories in Table [1] mostly in the PSF modelling 
and in the transfer of model to object (because of the use 
of bright star PSFs to calibrate faint galaxy PSFs). These 
residuals are assumed to be quasi-linear. CTI-induced non- 
linearity can be incorporated within the non-convolutive cat- 
egory. 



4-4-2 Bandpass and Out-of-Band Transmission 

The wavelength dependence of the PSF can be determined 
from the data by comparing to stars that cover a range in 
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Figure 11. The relative change in (top panels) and (bot- 
tom panels) as a function of the level of out-of-band transmission 
for a flat-spectrum source for the Euclid reference. The left pan- 
els show the impact without any off-band information, while the 
right panels show the impact if the off-band information is used 
to in the PSF modelling. 

colour for wavelengths where the transmission is high. Out- 
side the nominal band, where the tranmission is low, no 
information can be recovered. 

The flux that is transmitted out-of-band contributes to 
the galaxy PSF, which is a concern. Ideally, in Eauations ll6l 
and [13 T(A) = 1 in-band and T(A) = out-of-band. The 
transition has a flnite width, and the level of out-of-band 
transmission is /out > 0. As a result the width of the tran- 
sition region and the allowed (average) level of out-of-band 
transmission need to be determined. Under the assumption 
that the bias is small, the relative errors in the PSF size and 
shape can be determined by taking the ratio of the in- and 
out-of-band contributions to the integrals in Equations [TB] 
and 1171 so that this is an upper limit to the impact of the 
out-of-band leakage. 

The most conservative approach is to assume no knowl- 
edge about the out-of-band PSF. The left panels in Fieure lTll 
shows how the PSF size and shape change for a flat spec- 
trum source (/(A) =constant) as a function of /out where we 
assumed oc A'^'^^. This is the case for the Euclid mission, 
but the results are relatively insensitive to the assumed slope 
of the spectral dependance. In practice, however, it is pos- 
sible to extrapolate the observed PSF to other wavelengths 
using a model to correct for the leakage more accurately. 
The top panels of Figure [T^] show the out-of-band 400nm 
PSF size and ellipticity as a function of the in-band 800nm 
values for theEuclid mission for different parts of the field 
of view and for different optical alignments. The residuals 
to linear fits are given in the lower panels, which indicates 
that biases in the 400nm size and ellipticity (per compo- 
nent) as predicted from the 800nm values are (in this case) 
smaller than = 0.015 and \Sei\ = 7.5 x 10"^ respec- 

tively (dashed lines in Figure [12)) . With such an analysis, 
a more relaxed out-of-band transmission requirement can 
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Figure 12. The correlation between the PSF sizes at 400nm and SOOnm (top left) and PSF eUipticity components (top right) as inferred 
from the Euclid model PSFs. The drawn lines are linear fits to the measurements, with the results shown in the lower panels. The dashed 
lines indicate reasonable maximum uncertainties. These can then be used to derive constraints on the out-of-band flux, and hence on the 
effect of the out-of-band transmission levels as shown in the right panel of Figure 1111 



be permitted. This is evident in the different x-axis scale 
for the right hand panels of Figure [TT] (calculated for these 
\&R^\ = 0.015 and \5ei\ = 7.5 x 10"^ values). The relaxation 
in the requirement is particularly true for the PSF size. 

4-4.3 Galaxy Colours 

In Sections 14.21 and 14.31 we described how the PSF can be 
constructed with high accuracy using observations of stars 
in the field-of-view for any position in the focal plane and 
for any colour of star. The PSF entries in Equations [6] and [8] 
refer to the PSF with which each galaxy has been convolved 
by the instrument. This requires an estimate of the galaxy 
spectral energy distribution (SED). A good estimate of the 
galaxy SED can be inferred from broad-band data, which 
are typically available in any case because of the need to 
determine photometric redshifts for the source galaxies. The 
actual SED, however, depends on the star formation history 
of the galaxy, its metallicity, redshift, etc. Hence it cannot 
be known perfectly, which inevitably leads to an error in the 
estimate of the PSF to be used for the galaxy PSF. 

The impact of uncertainties in the SED on the PSF 
size has been studied by Cypriano et al., (2010). Within 
the assumptions of Gaussian PSFs with fully sampled data, 
they find that the PSF size can be recovered with a relative 
uncertainty less than ~ 2 x 10""*. Spatially varying SEDs 
(spatial colour gradients within the galaxy) resulting from a 
broad bandpass have been shown to cause TVf-like biases at 
the level of ~ 5 x 10"* (see Voigt et al, 2012, Semboloni et 
al., 2013). This is comparable to the allocation in Table [T] 



5 SUMMARY 

We have set out with this paper to extend in a practical 
scheme the more general treatments of weak lensing mea- 



surements in the literature, and particularly that of MHK12. 
This can be used as a framework to define a next-generation 
space-based weak lensing experiment. 

We have started with the main requirements. It is neces- 
sary to observe a large enough number of galaxies through 
a wide-area survey with sufficient photometric sensitivity, 
range of redshift and spatial resolution to ensure that the 
parameters in different cosmological models can be tightly 
constrained (these parameters may be those in the standard 
Concordance Model, or from alternative models) . The size of 
the next-generations survey must be very large, some 15000 
square degrees, observed to jtiab > 24.5, in order to make 
available more than 10^ galaxies. With such a survey, the 
large intrinsic variation within the galaxy population can 
be averaged to produce very precise measurements of the 
cosmological parameters. However, with such precision, sys- 
tematic effects in the measurements potentially become the 
limiting factor. This leads to the other main requirement 
which is that the parameters used in the derivation of the 
shear information, principally the shape of the point spread 
function (PSF), are known with sufficient accuracy. These, 
together with the biases introduced by imperfect shear mea- 
surement methodologies, are constrained to be less than a 
small factor (a bias-over-error ratio of 0.31; MHK12) of 
the uncertainties arising from the finite size of the survey, 
thus ensuring that the cosmological parameters will be de- 
rived with the required accuracy. 

The main requirements are therefore survey size, depth, 
spatial resolution, the knowledge of the instrument charac- 
teristics and the extent to which the biases can be corrected. 
The first three are relatively conventional, although demand- 
ing: they drive the size of the telescope, the field of view, the 
detector pixel scale and noise levels, the survey duration, and 
so on. The remaining two constitute the different and partic- 
ularly challenging aspect of a weak lensing experiment. This 
requires a detailed cataloging of all of the potential effects 
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which affect our knowledge of the instrument, and particu- 
larly the PSF, the classification of these effects into different 
categories, and the appreciation of how and to what extent 
each will impact this knowledge. 

We have therefore examined these effects, first following 
MHK12 in considering additive and multiplicative biases A 
and M in separating out the linear and non-linear contri- 
butions to each, together with the biases introduced by the 
weight function in the modelling (a^ and fi terms in Equa- 
tions [S] and Linear contributions have been represented 
by convolutions, while non-linear effects (which generally 
arise in the detectors and electronics) are non-convolutive. 
We then examined the different scale of the contributions 
in each, and with the consequent weighting, combine them 
with a permitted error in our knowledge of the ellipticity 
and size of the system PSF to calculate the impact on the 
total A and M. These knowledge errors can be adjusted 
and balanced, based on feasibility considerations, to arrive 
at a set of permitted values. Then, we marshalled all of the 
individual contributing factors to these convolutive and non- 
convolutive effects into categories, such as those arising from 
imperfectly known source characteristics, satellite pointing 
errors, calibration residuals, PSF modelling errors, detec- 
tor imperfections (especially arising from radiation damage), 
and calculated their impact with example numerical values, 
again weighting these appropriately. We noted that their 
aggregate contributions must equal or be less than those al- 
located at the highest level, and again some adjustment and 
re-balancing may be required. An example of these factors 
was provided in Table [T] 

We then know what is required to achieve the scientific 
goals of the weak lensing survey. In order to have made the 
allocations in the contributions to the overall PSF knowl- 
edge budget, we have evaluated what is or may be feasible. 
We described briefly in Section [3] the ingredients incorpo- 
rated into the simulations. Because the systematic effects 
we are controlling have to be known very accurately, a deep 
understanding of the instrumental effects is required, from 
the range of variation of the telescope PSF, to the pointing 
characteristics of the satellite, and to subtle detector effects 
of various types. Having made the simulations, we then ex- 
plained the main steps in the processing of the simulated 
data. We find that standard processing will be adequate in 
the fiat-fielding and linearity corrections, while most of the 
other standard data processing procedures (bias subtrac- 
tion etc.) contribute second-order effects which don't feed 
directly into the ellipticity in most cases. The correction for 
the CCD CTI caused by radiation damage is the principal 
matter to be addressed at this stage. We noted the algorithm 
in table 1 of Massey et at, (2010) by which the trailing in the 
image can be largely corrected by linear combinations of the 
observed (or, at this stage, simulated) data with copies of 
these same data passed through a radiation damage model. 
At the end of this process, the best image data which can be 
generated using the calibrations and the radiation modelling 
is available for further analysis. 

We then examined whether the performance we can ob- 
tain for each constituent contribution remains reasonable. 
We check on the effect of the sampling, and conclude that 
with only three slightly displaced exposures, the slight un- 
dersampling at 0.688 Nyquist does not meet the stringent 
sampling requirement criteria, but not by a large amount. 



While it is not yet quantified how much these sampling cri- 
teria can be relaxed without more noticeably impacting the 
survey's weak lensing systematic error budget, and further 
work is needed, we found in Section 14.31 (where the three 
exposure case including CTI is propagated into the shear 
power spectrum) that such variation appears to have a lim- 
ited impact on the Dark Energy FoM. We continued with the 
investigation of the level of knowledge that can be reached 
in the PSF model. The aim here was not to identify the 
ultimate PSF model to be used, but to show that with an 
analysis of the modes of PSF variation combined with the 
Bayesian model fitting, the performance allocated in Equa- 
tion[S]can be achieved. Further modelling advances will pro- 
vide additional margin. We generated the eigenmode basis 
set for the Euclid case, over the full field of view, and over 
a range of optical system characteristics, arising from mis- 
alignments and manufacturing errors, finding that the num- 
ber of modes required is in the range 20 — 70. We finally 
examined how many stars would be required in order to 
retrieve the PSF from the Bayesian model fitting of the nor- 
mal modes to the simulated data to the accuracy allocated 
in Table [T] given a basis set with 40 components, and the 
actual pixellised, noisy PSFs. We found that for a reason- 
able field of view the PSF can be recovered on each field 
independently, without any reliance on the stability of the 
optical system from field to field. If such variations can be 
tracked, additional performance could be achieved. 

We finally examined the residuals in the data caused 
by the imperfect correction of the CTI caused by radiation 
damage. We found that the ultimate accuracy of the process 
is limited by the readout noise of the CCD and detection 
chain, as this adds uncertainties to the measurement of the 
charge trails. The lack of fidelity of the radiation model, and 
limited knowledge of the parameters within it contribute 
to the residuals, but because these are in the frame of the 
detector, with particular orientations, they can be minimised 
by iteration of the model. In addition we found that the 
effect of the imperfect CTI correction is limited to certain 
angular scales, of the order those subtended on the sky by 
the CCD. 

We ended with a brief reference to factors affecting the 
galaxy modelling itself, in particular the effects of imperfec- 
tions in the linearity correction, of spectral leakage outside 
of the defined bandpass and of the spatially variable SEDs 
within the galaxies. 
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